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ENDOGENOUS AND NON-ENDOGENOUS VERSIONS OF 
HUMAN G PROTEIN-C OUPEED RECEPTORS 



FIELD OF THE INVENTION 



The invention disclosed in this patent document relates to transmembrane 
receptors, and more particularly to human G protein-coupled receptors, and 
specifically to endogenous human GPCRs with particular emphasis on non- 
endogenous versions of the GPCRs that have been altered to establish or enhance 
10 constitutive activity of the receptor. Preferably, the altered GPCRs are used for the 



Although a number of receptor classes exist in humans, by far the most abundant 
and therapeutically relevant is represented by the G protein-coupled receptor (GPCR or 
GPCRs) class. It is estimated that there are some 100,000 genes within the human 

20 genome, and of these, approximately 2%, or 2,000 genes, are estimated to code for 
GPCRs. Receptors, including GPCRs, for which the endogenous ligand has been 
identified are referred to as "known" receptors, while receptors for which the 
endogenous ligand has not been identified are referred to as "orphan" receptors. GPCRs 
represent an important area for the development of pharmaceutical products: from 

25 approximately 20 of the 100 known GPCRs, approximately 60% of all prescription 
pharmaceuticals have been developed. 

GPCRs share a common structural motif All these receptors have seven 
sequences of between 22 to 24 hydrophobic amino acids that form seven alpha helices, 
each of which spans the membrane (each span is identified by number, i.e., 

30 transmembrane- 1 (TM-1), transmebrane-2 (TM-2), etc.). The transmembrane helices 
are joined by strands of amino acids between transmembrane-2 and transmembrane-3, 




id ate compounds as receptor agonists, inverse agonists or 



partial agonists having potential applicability as therapeutic agents. 
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transmembrane-4 and transmembrane-5, and transmembrane-6 and transmembrane-7 on 
the exterior, or "extracellular" side, of the cell membrane (these are referred to as 
"extracellular" regions 1, 2 and 3 (EC-1, EC-2 and EC-3), respectively). The 
transmembrane helices are also joined by strands of amino acids between 
5 transmembrane- 1 and transmembrane-2, transmembrane-3 and transmembrane-4, and 
transmembrane-5 and transmembrane-6 on the interior, or "intracellular" side, of the cell 
membrane (these are referred to as "intracellular" regions 1, 2 and 3 (IC-1, IC-2 and IC- 
3), respectively). The "carboxy" ("C") terminus of the receptor lies in the intracellular 
space within the cell, and the "amino" ("N") terminus of the receptor lies in the 

1 0 extracellular space outside of the cell 

Generally, when an endogenous ligand binds with the receptor (often referred to 
as "activation" of the receptor), there is a change in the conformation of the intracellular 
region that allows for coupling between the intracellular region and an intracellular "G- 
protein." It has been reported that GPCRs are "promiscuous" with respect to G proteins, 

15 i.e., that a GPCR can interact with more than one G protein. See, Kenakin, T., 43 Life 
Sciences 1095 (1 988). Although other G proteins exist, currently, Gq, Gs, Gi, Gz and Go 
are G proteins that have been identified. Endogenous ligand-activated GPCR coupling 
with the G-protein begins a signaling cascade process (referred to as "signal 
transduction")* Under normal conditions, signal transduction ultimately results in 

20 cellular activation or cellular inhibition. It is thought that the IC-3 loop as well as the 
carboxy terminus of the receptor interact with the G protein. 

Under physiological conditions, GPCRs exist in the cell membrane in 
equilibrium between two different conformations: an "inactive" state and an "active" 
state. A receptor in an inactive state is unable to link to the intracellular signaling 

25 transduction pathway to produce a biological response. Changing the receptor 
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conformation to the active state allows linkage to the transduction pathway (via the G- 
protein) and produces a biological response. 

A receptor may be stabilized in an active state by an endogenous ligand or a 
compound such as a drug. Recent discoveries, including but not exclusively limited to 
5 modifications to the amino acid sequence of the receptor, provide means other than 
endogenous ligands or drugs to promote and stabilize the receptor in the active state 
conformation. These means effectively stabilize the receptor in an active state by 
simulating the effect of an endogenous ligand binding to the receptor. Stabilization by 
such ligand-independent means is termed "constitutive receptor activation.'" 

10 

SUMMARY OF THE INVENTION 

Disclosed herein are endogenous and non-endogenous versions of human 
GPCRs and uses thereof. 

1 5 BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 provides an illustration of second messenger IP 3 production from 
endogenous version RUP12 ("RUP^") as compared with the control ("CMV"). 

Figure 2 is a graphic representation of the results of a second messenger cell- 
based cyclic AMP assay providing comparative results for constitutive signaling of 
20 endogenous RUP 1 3 r RUP 1 3") and a control vector ("CMV"). 

Figure 3 is a diagrammatic representation of the signal measured comparing 
CMV, endogenous RUP13 ("RUP 13 wt") and non-endogenous, constitutively activated 
RUP 13 ("RUP13(A268KD, utilizing 8XCRE-Luc reporter plasmid. 
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Figure 4 is a graphic representation of the results of a [ 3:> S]GTPyS assay 
providing comparative results for constitutive signaling by RUP13:Gs Fusion Protein 
("RUPI3-Gs") and a control vector ("CMV"). 

Figure 5 is a diagrammatic representation of the signal measured comparing 
5 CMV, endogenous RUP14 ("RUP14 wt") and non-endogenous, constitutively activated 
RUP13 ("RUP14(L246K)"), utilizing 8XCRE-Luc reporter plasmid. 

Figure 6 is a diagrammatic representation of the signal measured comparing 
CMV, endogenous RUP15 ("RUP15 wt") and non-endogenous, constitutively activated 
RUP15 ("RUP15(A398K)"), utilizing 8XCR£-Luc reporter plasmid. 
10 Figure 7 is a graphic representation of the results of a second messenger cell- 

based cyclic AMP assay providing comparative results for constitutive signaling of 
endogenous RUP15 ("RUP15 wt"), non-endogenous, constitutively activated version of 
RUP15 ("RUP15( A398K)") and a control vector ("CMV"). 

Figure 8 is a graphic representation of the results of a [ 35 S]GTPyS assay 
15 providing comparative results for constitutive signaling by RUP15:Gs Fusion Protein 
("RUP1 5-Gs") and a control vector ("CMV"). 

Figure 9 provides an illustration of second messenger LP 3 production from 
endogenous version RUP17 ("RUP17") as compared with the control ("CMV"). 

Figure 10 provides an illustration of second messenger IP3 production from 
20 endogenous version RUP21 ("RUP21") as compared with the control ("CMV"). 

Figure 11 is a diagrammatic representation of the signal measured comparing 
CMV, endogenous RUP23 ("RUP23 wt") and non-endogenous, constitutively activated 
RUP23 ("RUP23(W275K)"), utilizing SXCRE-Luc reporter plasmid. 
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Figure 12 is a graphic representation of results from a primary screen of several 
candidate compounds against RUP13; results for "Compound A" are provided in well 
A2 and "Compound "B" arc provided in well G9. 



DETAILED DESCRIPTION 

The scientific literature that has evolved around receptors has adopted a number 
of terms to refer to ligands having various effects on receptors. For clarity and 
consistency, the following definitions will be used throughout this patent document. To 
the extent that these definitions conflict with other definitions for these terms, the 
following definitions shall control: 

AGONISTS shall mean materials (e.g., ligands, candidate compounds) that 

activate the intracellular response when they bind to the receptor, or enhance GTP 

binding to membranes. 

AMINO ACID ABBREVIATIONS used herein are set out in Table A: 

TABLE A 



ALANINE 


ALA 


A 


ARGININE 


ARG 


R 


ASPARAGINE 


ASN 


N 


ASPARTIC ACID 


ASP 


D 


CYSTEINE 


CYS 


C 


GLUTAMIC ACID 


GLU 


E 


GLUTAMINE 


GLN 


Q 


GLYCINE 


GLY 


G 


HISTTDINE 


HIS 


H 


ISOLEUCINE 


ILE 


I 


LEUCINE 


LEU 


L 


LYSINE 


LYS 


K 


METHIONINE 


MET 


M 



w o oi/3<>-m 



PC l /l Sot) 315(1') 



PHENYLALANINE 


PHE 


F 


PROLINE 


PRO 


P 


SERINE 


SER 


S 


THREONINE 


TIIR 


T 


TRYPTOPHAN 


TRP 


W 


TYROSINE 


TYR 


Y 


VALINE 


VAJL 


V 



PARTIAL AGONISTS shall mean materials (e.g., ligands, candidate 
compounds) that activate the intracellular response when they bind to the receptor to a 
lesser degree/extent than do agonists, or enhance GTP binding to membranes to a lesser 
5 degree/extent than do agonists. 

ANTAGONIST shall mean materials (e.g., ligands, candidate compounds) that 
competitively bind to the receptor at the same site as the agonists but which do not 
activate the intracellular response initiated by the active form of the receptor, and can 
thereby inhibit the intracellular responses by agonists or partial agonists. 
1 0 ANTAGONISTS do not diminish the baseline intracellular response in the absence of an 
agonist or partial agonist. 

CANDIDATE COMPOUND shall mean a molecule (for example, and not 
limitation, a chemical compound) that is amenable to a screening technique. Preferably, 
the phrase "candidate compound" does not include compounds which were publicly 
1 5 known to be compounds selected from the group consisting of inverse agonist, agonist or 
antagonist to a receptor, as previously determined by an indirect identification process 
("indirectly identified compound"); more preferably, not including an indirectly 
identified compound which has previously been determined to have therapeutic efficacy 
in at least one mammal; and, most preferably, not including an indirectly identified 
20 compound which has previously been determined to have therapeutic utility in humans. 
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COMPOSITION means a material comprising at least one component; a 
"pharmaceutical composition" is an example of a composition. 

COMPOUND EFFICACY shall mean a measurement of the ability of a 
compound to inhibit or stimulate receptor functionality, as opposed to receptor binding 
5 affinity. Exemplary means of detecting compound efficacy are disclosed in the Example 
section of this patent document. 

CODON shall mean a grouping of three nucleotides (or equivalents to 
nucleotides) which generally comprise a nucleoside (adenosine (A), guanosine (G), 
cytidinc (C), undine (U) and thymidine (T)) coupled to a phosphate group and which, 
10 when translated, encodes an amino acid. 

CONSTITUTIVELY ACTIVATED RECEPTOR shall mean a receptor 
subject to constitutive receptor activation. A constitutively activated receptor can be 
endogenous or non-endogenous. 

CONSTITUTIVE RECEPTOR ACTIVATION shall mean stabilization of a 
15 receptor in the active state by means other than binding of the receptor with its 
endogenous ligand or a chemical equivalent thereof. 

CONTACT or CONTACTING shall mean bringing at least two moieties 
together, whether in an in vitro system or an in vivo system. 

DIRECTLY IDENTIFYING or DIRECTLY IDENTIFIED, in relationship to 
20 the phrase "candidate compound", shall mean the screening of a candidate compound 
against a constitutively activated receptor, preferably a constitutively activated orphan 
receptor, and most preferably against a constitutively activated G protein-coupled cell 
surface orphan receptor, and assessing the compound efficacy of such compound. This 
phrase is, under no circumstances, to be interpreted or understood to be encompassed by 
25 or to encompass the phrase "indirectly identifying" or "indirectly identified." 
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ENDOGENOUS shall mean a material that a mammal naturally produces. 
ENDOGENOUS in reference to, for example and not limitation, the term "receptor," 
shall mean that which is naturally produced by a mammal (for example, and not 
limitation, a human) or a virus. By contrast, the term NON-ENDOGENOUS in this 
5 context shall mean that which is not naturally produced by a mammal (for example, and 
not limitation, a human) or a virus. For example, and not limitation, a receptor which is 
not constitutively active in its endogenous form, but when manipulated becomes 
constitutively active, is most preferably referred to herein as a "non-endogenous, 
constitutively activated receptor." Both terms can be utilized to describe both "in vivo" 

10 and "in vitro" systems. For example, and not limitation, in a screening approach, the 
endogenous or non-endogenous receptor may be in reference to an in vitro screening 
system. As a further example and not limitation, where the genome of a mammal has 
been manipulated to include a non-endogenous constitutively activated receptor, 
screening of a candidate compound by means of an in vivo system is viable. 

15 G PROTEIN COUPLED RECEPTOR FUSION PROTEIN and GPCR 

FUSION PROTEIN, in the context of the invention disclosed herein, each mean a non- 
endogenous protein comprising an endogenous, constitutively activate GPCR or a non- 
endogenous, constitutively activated GPCR fused to at least one G protein, most 
preferably the alpha (a) subunit of such G protein (this being the subunit that binds 

20 GTP), with the G protein preferably being of the same type as the G protein that 
naturally couples with endogenous orphan GPCR. For example, and not limitation, in an 
endogenous state, if the G protein "Gsa" is the predominate G protein that couples with 
the GPCR, a GPCR Fusion Protein based upon the specific GPCR would be a non- 
endogenous protein comprising the GPCR fused to Gsct; in some circumstances, as will 

25 be set forth below, a non-predominant G protein can be fused to the GPCR. The G 
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protein can be fused directly to the c-terminus of the constitutively active GPCR or there 
may be spacers between the two. 

HOST CELL shall mean a cell capable of having a Plasmid and/or Vector 
incorporated therein. In the case of a prokaryotic Host Cell, a Plasmid is typically 
5 replicated as a autonomous molecule as the Host Cell replicates (generally, the Plasmid 
is thereafter isolated for introduction into a eukaryotic Host Cell); in the case of a 
eukaryotic Host Cell, a Plasmid is integrated into the cellular DNA of the Host Cell such 
that when the eukaryotic Host Cell replicates, the Plasmid replicates. Preferably, for the 
purposes of the invention disclosed herein, the Host Cell is eukaryotic, more preferably, 
10 mammalian, and most preferably selected from the group consisting of 293, 293T and 
COS-7 cells. 

INDIRECTLY IDENTIFYING or INDIRECTLY IDENTIFIED means the 
traditional approach to the drug discovery process involving identification of an 
endogenous ligand specific for an endogenous receptor, screening of candidate 
15 compounds against the receptor for determination of those which interfere and/or 
compete with the ligand-receptor interaction, and assessing the efficacy of the compound 
for affecting at least one second messenger pathway associated with the activated 
receptor. 

INHIBIT or INHIBITING, in relationship to the term "response" shall mean 
20 that a response is decreased or prevented in the presence of a compound as opposed to in 
the absence of the compound. 

INVERSE AGONISTS shall mean materials (e.g., ligand, candidate compound) 
which bind to either the endogenous form of the receptor or to the constitutively 
activated form of the receptor, and which inhibit the baseline intracellular response 
25 initiated by the active form of the receptor below the normal base level of activity which 
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is observed in the absence of agonists or partial agonists, or decrease GTP binding to 
membranes. Preferably, the baseline intracellular response is inhibited in the presence of 
the inverse agonist by at least 30%, more preferably by at least 50%, and most preferably 
by at least 75%, as compared with the baseline response in the absence of the inverse 
5 agonist. 

KNOWN RECEPTOR shall mean an endogenous receptor for which the 
endogenous ligand specific for that receptor has been identified. 

LIGAND shall mean an endogenous, naturally occurring molecule specific for 
an endogenous, naturally occurring receptor. 

10 MUTANT or MUTATION in reference to an endogenous receptor's nucleic 

acid and/or amino acid sequence shall mean a specified change or changes to such 
endogenous sequences such that a mutated form of an endogenous, non-constitutively 
activated receptor evidences constitutive activation of the receptor. In terms of 
equivalents to specific sequences, a subsequent mutated form of a human receptor is 

15 considered to be equivalent to a first mutation of the human receptor if (a) the level of 
constitutive activation of the subsequent mutated form of a human receptor is 
substantially the same as that evidenced by the first mutation of the receptor; and (b) the 
percent sequence (amino acid and/or nucleic acid) homology between the subsequent 
mutated form of the receptor and the first mutation of the receptor is at least about 80%, 

20 more preferably at least about 90% and most preferably at least 95%. Ideally, and owing 
to the fact that the most preferred cassettes disclosed herein for achieving constitutive 
activation includes a single amino acid and/or codon change between the endogenous 
and the non-endogenous forms of the GPCR, the percent sequence homology should be 
at least 98%. 
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NON-ORPHAN RECEPTOR shall mean an endogenous naturally occurring 
molecule specific for an endogenous naturally occurring ligand wherein the binding of a 
ligand to a receptor activates an intracellular signaling pathway. 

ORPHAN RECEPTOR shall mean an endogenous receptor for which the 
5 endogenous ligand specific for that receptor has not been identified or is not known. 

PHARMACEUTICAL COMPOSITION shall mean a composition 
comprising at least one active ingredient, whereby the composition is amenable to 
investigation for a specified, efficacious outcome in a mammal (for example, and not 
limitation, a human). Those of ordinary skill in the art will understand and appreciate the 
10 techniques appropriate for determining whether an active ingredient has a desired 
efficacious outcome based upon the needs of the artisan. 

PLASMID shall mean the combination of a Vector and cDNA. Generally, a 
Plasmid is introduced into a Host Cell for the purposes of replication and/or expression 
of the cDNA as a protein. 
15 SECOND MESSENGER shall mean an intracellular response produced as a 

result of receptor activation. A second messenger can include, for example, inositol 
triphosphate (IP3), diacycglycerol (DAG), cyclic AMP (cAMP), and cyclic GMP 
(cGMP). Second messenger response can be measured for a determination of receptor 
activation. In addition, second messenger response can be measured for the direct 
20 identification of candidate compounds, including for example, inverse agonists, agonists, 
partial agonists and antagonists. 

STIMULATE or STIMULATING, in relationship to the term "response" shall 
mean that a response is increased in the presence of a compound as opposed to in the 
absence of the compound. 
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VECTOR in reference to cDNA shall mean a circular DNA capable of 
incorporating at least one cDNA and capable of incorporation into a Host Cell. 

The order of the following sections is set forth for presentational efficiency and 
is not intended, nor should be construed, as a limitation on the disclosure or the claims to 
5 follow. 

A. Introduction 

The traditional study of receptors has always proceeded from the a priori 
assumption (historically based) that the endogenous ligand must first be identified before 

10 discovery could proceed to find antagonists and other molecules that could affect the 
receptor. Even in cases where an antagonist might have been known first, the search 
immediately extended to looking for the endogenous ligand. This mode of thinking has 
persisted in receptor research even after the discovery of constitutively activated 
receptors. What has not been heretofore recognized is that it is the active state of the 

15 receptor that is most useful for discovering agonists, partial agonists, and inverse 
agonists of the receptor. For those diseases which result from an overly active receptor 
or an under-active receptor, what is desired in a therapeutic drug is a compound which 
acts to diminish the active state of a receptor or enhance the activity of the receptor, 
respectively, not necessarily a drug which is an antagonist to the endogenous ligand. 

20 This is because a compound that reduces or enhances the activity of the active receptor 
state need not bind at the same site as the endogenous ligand. Thus, as taught by a 
method of this invention, any search for therapeutic compounds should start by 
screening compounds against the ligand-independent active state. 

25 B. Identification of Human GPCRs 
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The efforts of the Human Genome project has led to the identification of a 
plethora of information regarding nucleic acid sequences located within the human 
genome; it has been the case in this endeavor that genetic sequence information has been 
made available without an understanding or recognition as to whether or not any 
5 particular genomic sequence does or may contain open-reading frame information that 
translate human proteins. Several methods of identifying nucleic acid sequences within 
the human genome are within the purview of those having ordinary skill in the art. For 
example, and not limitation, a variety of human GPCRs, disclosed herein, were 
discovered by reviewing the GenBank™ database. Table B, below, lists several 
10 endogenous GPCRs that we have discovered, along with other GPCR's that are 
homologous to the disclosed GPCR. 

TABLE B 



Disclosed 
Human 
Orphan GPCRs 


Accession 
Number 
Identified 


Open Reading 
Frame 
(Base Pairs) 


Reference To 
Homologous 
GPCR 


Per Cent 
Homology 
To Designated 
GPCR 


hRl!P8 


AL121755 


l,152bp 


NPY2R 


27° o 


hRlTP9 


ACO 1 13375 


l,260bp 


GAL2R 


22% 


hRlTPlO 


AC008745 


l,014bp 


C5aR 


40% 


hRUPl 1 


ACO 13396 


l,272bp 


HM74 


36" o 


hRUPl 2 


AP000808 


966bp 


Masl 


34% 


hRUPl 3 


ACO 1 1780 


K356bp 


Fish GPRX- 
ORYLA 


43° o 


hRUPl 4 


AL137118 


1.041 bp 


CysLTIR 


35% 


hRUPl 5 


AL016468 


l,527bp 


RE2 


30% 


hRUPl 6 


AL 136 106 


U>68bp 


GLR101 


37° o 


hRUPl 7 


AC023078 


%9bp 


Masl 


37 v ., 


hRUP18 


AC008547 


U05bp 


Oxytocin 


31% 


hRUPl 9 


AC026331 


1,041 bp 


HM74 


52% 


hRUP20 


AL 16 1458 


1,01 lbp 


GPR34 


25% 


hRUP21 


AC026756 


l,014bp 


P2Y1R 


37% 


hRUP22 


AC027026 


993bp 


RUP17 
Masl 


67% 
37% 
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HRUP23 


AC007104 


1 ,092bp 


Rat GPR26 


31% 


HRUP24 


AL355388 


1.125bp 


SAL PR 


44% 


hRUP25 


AC026331 


l,092bp 


1UV174 


95% 


hRUP26 


AC023040 


1 ,()44bp 


Rabbit 5HT1D 


27° « 


HRIJP27 


AC027643 


158,700 


MCH 


38° o 



Receptor homology is useful in terms of gaining an appreciation of a role of the 
receptors within the human body. As the patent document progresses, we will disclose 
techniques for mutating these receptors to establish non-endogenous, constitutively 
5 activated versions of these receptors. 

The techniques disclosed herein have also been applied to other human, orphan 
GPCRs known to the art, as will be apparent as the patent document progresses. 

C. Receptor Screening 

10 Screening candidate compounds against a non-endogenous, constitutively 

activated version of the human GPCRs disclosed herein allows for the direct 
identification of candidate compounds which act at this cell surface receptor, without 
requiring use of the receptor's endogenous ligand. Using routine, and often 
commercially available techniques, one can determine areas within the body where the 

15 endogenous version of human GPCRs disclosed herein is expressed and/or over- 
expressed. It is also possible using these techniques to determine related 
disease/disorder states which are associated with the expression and/or over-expression 
of the receptor; such an approach is disclosed in this patent document. 

With respect to creation of a mutation that may evidence constitutive activation 

20 of the human GPCR disclosed herein is based upon the distance from the proline residue 
at which is presumed to be located within TM6 of the GPCR; this algorithmic technique 
is disclosed in co-pending and commonly assigned patent document PCT Application 



\\ () ni 3<-TI P( II Son 3150*) 

Number PCTTJSW '23938, published as WO 00/22129 on April 20, 2000, which, along 
with the other patent documents listed herein, is incorporated herein by reference. The 
algorithmic technique is not predicated upon traditional sequence "alignment'' but rather 
a specified distance from the aforementioned TM6 proline residue (or, of course, 
5 endogenous constitutive substitutionf for such proline residue). By mutating the amino 
acid residue located 16 amino acid residues from this residue (presumably located in the 
IC3 region of the receptor) to, most preferably, a lysine residue, such activation may be 
obtained. Other amino acid residues may be useful in the mutation at this position to 
achieve this objective. 

10 

D. Disease/Disorder Identification and/or Selection 

As will be set forth in greater detail below, most preferably inverse agonists and 
agonists to the non-endogenous, constitutively activated GPCR can be identified by the 
methodologies of this invention. Such inverse agonists and agonists are ideal candidates 

1 5 as lead compounds in drug discovery programs for treating diseases related to this 
receptor. Because of the ability to directly identify inverse agonists to the GPCR, 
thereby allowing for the development of pharmaceutical compositions, a search for 
diseases and disorders associated with the GPCR is relevant. For example, scanning 
both diseased and normal tissue samples for the presence of the GPCR now becomes 

20 more than an academic exercise or one which might be pursued along the path of 
identifying an endogenous ligand to the specific GPCR. Tissue scans can be conducted 
across a broad range of healthy and diseased tissues. Such tissue scans provide a 
preferred first step in associating a specific receptor with a disease and/or disorder. 

Preferably, the DNA sequence of the human GPCR is used to make a probe for 

25 (a) dot-blot analysis against tissue-mRNA, and/or (h) RT-PCR identification of the 
expression of the receptor in tissue samples. The presence of a receptor in a tissue 
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source, or a diseased tissue, or the presence of the receptor at elevated concentrations in 
diseased tissue compared to a normal tissue, can he preferably utilized to identify a 
correlation with a treatment regimen, including but not limited to, a disease associated 
with that disease. Receptors can equally well be localized to regions of organs by this 
5 technique. Based on the known functions of the specific tissues to which the receptor is 
localized, the putative functional role of the receptor can be deduced. 

E. Screening of Candidate Compounds 

1. Generic GPCR screening assay techniques 

1 0 When a G protein receptor becomes constitutively active, it binds to a G protein 

(e.g., Gq, Gs, Gi, Gz, Go) and stimulates the binding of GTP to the G protein. The G 
protein then acts as a GTPase and slowly hydrolyzes the GTP to GDP, whereby the 
receptor, under normal conditions, becomes deactivated. However, constitutively 
activated receptors continue to exchange GDP to GTP. A non-hydrolyzable analog of 

15 GTP. f 35 SlGTPvS. can be used to monitor enhanced bindine to membranes which 
express constitutively activated receptors. It is reported that [ 35 S]GTPyS can be used to 
monitor G protein coupling to membranes in the absence and presence of ligand. An 
example of this monitoring, among other examples well-known and available to those in 
the art, was reported by Traynor and Nahorski in 1995. The preferred use of this assay 

20 system is for initial screening of candidate compounds because the system is generically 
applicable to all G protein-coupled receptors regardless of the particular G protein that 
interacts with the intracellular domain of the receptor. 

2. Specific GPCR screening assay techniques 

Once candidate compounds are identified using the "generic" G protein-coupled 
25 receptor assay (i.e., an assay to select compounds that are agonists, partial agonists, or 
inverse agonists), further screening to confirm that the compounds have interacted at the 
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receptor site is preferred. For example, a compound identified by the "generic" assay 
may not bind to the receptor, but may instead merely "uncouple" the G protein from the 
intracellular domain. 

a. Gs, Gz and Gi. 

Gs stimulates the enzyme adenylyl cyclase. Gi (and Gz and Go), on the other 
hand, inhibit this enzyme. Adenylyl cyclase catalyzes the conversion of ATP to cAMP; 
thus, constitutively activated GPCRs that couple the Gs protein are associated with 
increased cellular levels of cAMP. On the other hand, constitutively activated GPCRs 
that couple Gi (or Gz, Go) protein are associated with decreased cellular levels of c AMP. 
See, generally, ''Indirect Mechanisms of Synaptic Transmission," Chpt. 8, From Neuron 
To Brain (3 rd Ed.) Nichols, J.G. et al eds, Sinauer Associates, Inc. (1992). Thus, assays 
that detect cAMP can be utilized to determine if a candidate compound is, e.g., an 
inverse agonist to the receptor (i.e., such a compound would decrease the levels of 
cAMP). A variety of approaches known in the art for measuring cAMP can be utilized; 
a most preferred approach relies upon the use of anti-cAMP antibodies in an ELISA- 
based format. Another type of assay that can be utilized is a whole cell second 
messenger reporter system assay. Promoters on genes drive the expression of the 
proteins that a particular gene encodes. Cyclic AMP drives gene expression by 
promoting the binding of a cAMP-responsive DNA binding protein or transcription 
factor (CREB) that then binds to the promoter at specific sites called cAMP response 
elements and drives the expression of the gene. Reporter systems can be constructed 
which have a promoter containing multiple cAMP response elements before the reporter 
gene, e.g., p-galactosidase or luciferase. Thus, a constitutively activated Gs-linked 
receptor causes the accumulation of cAMP that then activates the gene and expression of 
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the reporter protein. The reporter protein such as p-galactosidase or luciferase can then 
be detected using standard biochemical assays (Chen et al. 1995). 
b. Go and Gq. 

5 Gq and Go are associated with activation of the enzyme phospholipase C, which 

in turn hydrolyzes the phospholipid PIP2, releasing two intracellular messengers: 
diacycloglycerol (DAG) and inistol 1 ,4,5-triphoisphate (IP3). Increased accumulation of 
IP3 is associated with activation of Gq- and Go-associated receptors. See, generally, 
"Indirect Mechanisms of Synaptic Transmission," Chpt. 8, From Neuron To Brain (3 rd 

10 Hd.) Nichols, J.G. et al eds. Sinauer Associates, Inc. (1992). Assays that detect IP3 
accumulation can be utilized to determine if a candidate compound is, e.g., an inverse 
agonist to a Gq- or Go-associated receptor (i.e., such a compound would decrease the 
levels of IP3). Gq-associated receptors can also been examined using an API reporter 
assay in that Gq-dependent phospholipase C causes activation of genes containing API 

15 elements; thus, activated Gq-associated receptors will evidence an increase in the 
expression of such genes, whereby inverse agonists thereto will evidence a decrease in 
such expression, and agonists will evidence an increase in such expression. 
Commercially available assays for such detection are available. 
3. GPCR Fusion Protein 

20 The use of an endogenous, constitutively activate orphan GPCR or a non- 

endogenous, constitutively activated orphan GPCR, for use in screening of candidate 
compounds for the direct identification of inverse agonists, agonists and partial agonists 
provide an interesting screening challenge in that, by definition, the receptor is active 
even in the absence of an endogenous ligand bound thereto. Thus, in order to 

25 differentiate between, e.g., the non-endogenous receptor in the presence of a candidate 
compound and the non-endogenous receptor in the absence of that compound, with an 
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aim of such a differentiation to allow for an understanding as to whether such compound 
may be an inverse agonist, agonist, partial agonist or have no affect on such a receptor, it 
is preferred that an approach be utilized that can enhance such differentiation. A 
preferred approach is the use of a GPCR Fusion Protein. 

Generally, once it is determined that a non-endogenous orphan GPCR has been 
constitutively activated using the assay techniques set forth above (as well as others), it 
is possible to determine the predominant G protein that couples with the endogenous 
GPCR. Coupling of the G protein to the GPCR provides a signaling pathway that can be 
assessed. Because it is most preferred that screening take place by use of a mammalian 
expression system, such a system will be expected to have endogenous G protein therein. 
Thus, by definition, in such a system, the non-endogenous, constitutively activated 
orphan GPCR will continuously signal. In this regard, it is preferred that this signal be 
enhanced such that in the presence of, e.g., an inverse agonist to the receptor, it is more 
likely that it will be able to more readily differentiate, particularly in the context of 
screening, between the receptor when it is contacted with the inverse agonist. 

The GPCR Fusion Protein is intended to enhance the efficacy of G protein 
coupling with the non-endogenous GPCR. The GPCR Fusion Protein is preferred for 
screening with a non-endogenous, constitutively activated GPCR because such an 
approach increases the signal that is most preferably utilized in such screening 
techniques. This is important in facilitating a significant "signal to noise" ratio; such a 
significant ratio is import preferred for the screening of candidate compounds as 
disclosed herein. 

The construction of a construct useful for expression of a GPCR Fusion Protein 
is within the purview of those having ordinary skill in the art. Commercially available 
expression vectors and systems offer a variety of approaches that can fit the particular 

V) 
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needs of an investigator. Hie criteria of importance for such a GPCR Fusion Protein 
construct is that the endogenous GPCR sequence and the G protein sequence both be in- 
frame (preferably, the sequence for the endogenous GPCR is upstream of the G protein 
sequence) and that the "stop" codon of the GPCR must be deleted or replaced such that 
5 upon expression of the GPCR, the G protein can also be expressed. The GPCR can be 
linked directly to the G protein, or there can be spacer residues between the two 
(preferably, no more than about 12, although this number can be readily ascertained by 
one of ordinary skill in the art). We have a preference (based upon convenience) of use 
of a spacer in that some restriction sites that are not used will, effectively, upon 

10 expression, become a spacer. Most preferably, the G protein that couples to the non- 
endogenous GPCR will have been identified prior to the creation of the GPCR Fusion 
Protein construct. Because there are only a few G proteins that have been identified, it is 
preferred that a construct comprising the sequence of the G protein (i.e., a universal G 
protein construct) be available for insertion of an endogenous GPCR sequence therein; 

15 this provides for efficiency in the context of large-scale screening of a variety of 
different endogenous GPCRs having different sequences. 

As noted above, constitutively activated GPCRs that couple to Gi, Gz and Go are 
expected to inhibit the formation of cAMP making assays based upon these types of 
GPCRs challenging (i.e., the cAMP signal decreases upon activation thus making the 

20 direct identification of, e.g, inverse agonists (which would further decrease this signal), 
interesting. As will be disclosed herein, we have ascertained that for these types of 
receptors, it is possible to create a GPCR Fusion Protein that is not based upon the 
endogenous GPCR's endogenous G protein, in an effort to establish a viable cyclase- 
based assay. Thus, for example, an endogenous Gi coupled receptor can be fused to a Gs 

25 protein - we believe that such a fusion construct, upon expression, "drives" or "forces" 
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the endogenous GPCR to couple with, e.g., Gs rather than the "natural" Gi protein, such 
that a cyclase-based assay can be established. Thus, for Gi, Gz and Go coupled 
receptors, we prefer that that when a GPCR Fusion Protein is used and the assay is based 
upon detection of adenylyl cyclase activity, that the fusion construct be established with 
5 Gs (or an equivalent G protein that stimulates the formation of the enzyme adenylyl 
cyclase). 

Equally effective is a G Protein Fusion construct that utilizes a Gq Protein fused 
with a Gs, Gi, Gz or Go Protein. A most preferred fusion construct can be accomplished 
with a Gq Protein wherein the first six (6) amino acids of the G-protein a-subunit 

10 ("Gaq") is deleted and the last five (5) amino acids at the C-teiminal end of Gaq is 
replaced with the corresponding amino acids of the Ga of the G protein of interest. For 
example, a fusion construct can have a Gq (6 amino acid deletion) fused with a Gi 
Protein, resulting in a "Gq/Gi Fusion Construct". We believe that this fusion construct 
will force the endogenous Gi coupled receptor to couple to its non-endogenous G 

15 protein, Gq, such that the second messenger, for example, inositol triphosphate or 
diacylgycerol, can be measured in lieu of cAMP production. 

4. Co-transfection of a Target Gi Coupled GPCR with a Signal- 
Enhancer Gs Coupled GPCR (cAMP Based Assays) 

20 

A Gi coupled receptor is known to inhibit adenylyl cyclase, and, therefore, 
decrease the level of cAMP production, which can make assessment of cAMP levels 
challenging. An effective technique in measuring the decrease in production of cAMP as 
an indication of constitutive activation of a receptor that predominantly couples Gi upon 
25 activation can be accomplished by co-transfecting a signal enhancer, e.g., a non- 
endogenous, constitutively activated receptor that predominantly couples with Gs upon 
activation (e.g., TSHR-A623I, disclosed below), with the Gi linked GPCR. As is 
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apparent, constitutive activation of a Gs coupled receptor can be determined based upon 
an increase in production of cAMP. Constitutive activation of a Gi coupled receptor 
leads to a decrease in production cAMP. Thus, the co-transfection approach is intended 
vo advantageously exploit these "opposite" affects. For example, co-transfection of a 
5 non-endogenous, constitutively activated Gs coupled receptor (the "signal enhancer") 
with the endogenous Gi coupled receptor (the "target receptor") provides a baseline 
cAMP signal (i.e., although the Gi coupled receptor will decrease cAMP levels, this 
"decrease" will be relative to the substantial increase in cAMP levels established by 
constitutively activated Gs coupled signal enhancer). By then co-trans fecting the signal 

1 0 enhancer with a constitutively activated version of the target receptor, cAMP would be 
expected to further decrease (relative to base line) due to the increased functional activity 
of the Gi target (/.<?., which decreases cAMP). 

Screening of candidate compounds using a cAMP based assay can then be 
accomplished, with two provisos: first, relative to the Gi coupled target receptor, 

15 "opposite" effects will result, i.e., an inverse agonist of the Gi coupled target receptor 
will increase the measured cAMP signal, while an agonist of the Gi coupled target 
receptor will decrease this signal; second, as would be apparent, candidate compounds 
that are directly identified using this approach should be assessed independently to 
ensure that these do not target the signal enhancing receptor (this can be done prior to or 

20 after screening against the co-transfected receptors). 

F. Medicinal Chemistry 

Generally, but not always, direct identification of candidate compounds is 
preferably conducted in conjunction with compounds generated via combinatorial 
25 chemistry techniques, whereby thousands of compounds are randomly prepared for 
such analysis. Generally, the results of such screening will be compounds having 
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unique core structures; thereafter, these compounds are preferably subjected to 
additional chemical modification around a preferred core structure(s) to further 
enhance the medicinal properties thereof Such techniques are known to those in the 
art and will not be addressed in detail in this patent document. 

G. Pharmaceutical compositions 

Candidate compounds selected for further development can be formulated into 
pharmaceutical compositions using techniques well known to those in the art. Suitable 
pharmaceutically-acceptable carriers are available to those in the art; for example, see 
Remington's Pharmaceutical Sciences, 16 lh Edition, 1980, Mack Publishing Co., (Oslo 
et ah, eds.). 

H. Other Utility 

Although a preferred use of the non-endogenous versions the human GPCRs 
disclosed herein may be for the direct identification of candidate compounds as inverse 
agonists, agonists or partial agonists (preferably for use as pharmaceutical agents), these 
versions of human GPCRs can also be utilized in research settings. For example, in vitro 
and in vivo systems incorporating GPCRs can be utilized to further elucidate and 
understand the roles these receptors play in the human condition, both normal and 
diseased, as well as understanding the role of constitutive activation as it applies to 
understanding the signaling cascade. The value in non-endogenous human GPCRs is 
that their utility as a research tool is enhanced in that, because of their unique features, 
non-endogenous human GPCRs can be used to understand the role of these receptors in 
the human body before the endogenous ligand therefore is identified. Other uses of the 
disclosed receptors will become apparent to those in the art based upon, inter alia, a 
review of this patent document. 
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EXAMPLES 

The following examples are presented for purposes of elucidation, and not 
limitation, of the present invention. While specific nucleic acid and amino acid 
sequences are disclosed herein, those of ordinary skill in the art are credited with the 
5 ability to make minor modifications to these sequences while achieving the same or 
substantially similar results reported below. The traditional approach to application or 
understanding of sequence cassettes from one sequence to another (e.g. from rat receptor 
to human receptor or from human receptor A to human receptor B) is generally 
predicated upon sequence alignment techniques whereby the sequences are aligned in an 

10 effort to determine areas of commonality. The mutational approach disclosed herein 
does not rely upon this approach but is instead based upon an algorithmic approach and a 
positional distance from a conserved proline residue located within the TM6 region of 
human GPCRs. Once this approach is secured, those in the art are credited with the 
ability to make minor modifications thereto to achieve substantially the same results (i.e., 

15 constitutive activation) disclosed herein. Such modified approaches are considered 
within the purview of this disclosure. 
// 
// 
// 

20 // 

Example 1 

Endogenous Human Gpcrs 

1. Identification of Human GPCRs 

The disclosed endogenous human GPCRs were identified based upon a review 
25 of the GenBank™ database information. While searching the database, the following 
cDNA clones were identified as evidenced below (Table C). 

24 
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TABLE C 



| Disclosed 
' Human 
Orphan 
CPCRs 


Accession 
Number 
Identified 


Complete DNA 
Sequence 
(Base Pairs) 


Open Reading 
Frame 
(Base Pairs) 


Nucleic 
Acid 

SEQJD. 
NO. 


Amino 
Acid 
SEQ.ID. 
NO. 


HRIIP8 


AL121755 


147,566bp 


l,152bp 


1 


2 


HRIJP9 


ACO 113375 


143,181bp 


l,260bp 


3 


4 


hRUPIO 


AC008745 


94,194bp 


l,014bp 


5 


(> 


hRUPll 


AC013396 


155,086bp 


l,272bp 


7 


8 


HRIJP12 


AP00O8O8 


177,764bp 


966bp 


9 


10 


hRUP13 


ACO 11780 


167,8 19bp 


l,356bp 


1 1 


12 


hRUP14 


AL137118 


168,297bp 


1,04 lbp 


13 


14 


hRlJPIS 


ALO 16468 


138,828bp 


l,527bp 


15 


16 


HRUP16 


AL136106 


208,042bp 


l,068bp 


17 


18 


HRUP17 


AC023078 


161,735bp 


969bp 


19 


20 


hRUP18 


AC008547 


117,304bp 


l,305bp 


21 


22 


HRIFP19 


AC026331 


145,183bp 


1,04 lbp 


23 


24 


KRUP20 


AL361458 


163,51 lbp 


1,01 lbp 


25 


26 


hRUP21 


AC026756 


156,534bp 


l,014bp 


27 


28 


hRLJP22 


AC027026 


151,81 lbp 


993bp 


29 


30 


hRUP23 


AC007104 


200,000bp 


l,092bp 


31 


32 


hRUP24 


AL355388 


190,538bp 


l,125bp 


33 


34 


hRUP25 


AC02633 1 


145,183bp 


l,092bp 


35 


36 


hRUP26 


AC023040 


178,508bp 


1 ,044bp 


37 


38 


HRUP27 


AC027643 


158,700bp 


1 ,020bp 


39 


40 I 



2. Full Length Cloning 
a. hRUP8 (Seq. Id. Nos. 1 & 2) 
The disclosed human RUP8 was identified based upon the use of EST database 
(dbEST) information. While searching the dbEST, a cDNA clone with accession number 
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AL121755 was identified to encode a novel GPCR. The following PCR primers were 
used for RT-PCR with human testis Marathon-Ready cDNA (Clontech) as templates: 
5 '-CTTGCAG ACATCACCATGGCAGCC-3 ' (SEQ.ID. NO. :41 ; sense) and 
5 -GTG ATGCTCTGAGTACTGG ACTGG-3 1 (SEQ.ID.NO.: 42; antisense). 
PCR was performed using Advantage cDNA polymerase (Clontech; manufacturing 
instructions will be followed) in 50ul reaction by the following cycles: 94°C for 30 sec; 
94°C for 10 sec; 65°C for 20 sec, 72°C for 1.5 min, and 72°C for 7 min. Cycles 2 
through 4 were repeated 35 times. 

A 1.2kb PCR fragment was isolated and cloned into the pCRII-TOPO vector 
(Invitrogen) and sequenced using the ABI Big Dye Terminator kit (P.E. Biosystem). 
See, SEQ.ID.NO.: 1. The putative amino acid sequence for RUP8 is set forth in 
SEQ.ID.NO.:2. 

b. hRUP9 (Seq. Id. Nos. 3 & 4) 
The disclosed human RUP9 was identified based upon the use of GeneBank 
database information. While searching the database, a cDNA clone with Accession 
Number AC01 1375 was identified as a human genomic sequence from chromosome 
5. The full length RUP9 was cloned by PCR using primers: 
5 '-G A AGCTGTG AAG AGTG ATGC-3 ' (SEQ.ID.NO. :43; sense), 
5 '-GTC AGCAATATTGATA AGC AGC AG-3 ' (SEQ.ID.NO.:44; antisense) 
and human genomic DNA (Promega) as a template. Taq Plus Precision polymerase 
(Stratagene) was used for the amplification in a 100(il reaction with 5% DMSO by the 
following cycle with step 2 to step 4 repeated 35 times: 94°C for 1 minute; 94°C for 
30 seconds; 56°C for 30 seconds; 72°C for 2 minutes; 72°C for 5 minutes. 

A 13 Kb PCR fragment was isolated and cloned into the pCRII-TOPO vector 
(Invitrogen) from 1% agarose gel and completely sequenced using the ABI Big Dye 
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Terminator kit (P.F. Biosystem). Sec, SEQ. ID. NO. :3. The putative amino acid 
sequence for RIIPS is set forth in SEQTD.NO.:4. The sequence of RLJP9 clones isolated 
from human genomic DNA matched with the sequence obtained from data base. 

c. hRUPIO (Seq. Id. Nos. 5 & 6) 

5 The disclosed human RUP10 was identified based upon the use of GenBank 

database information. While searching the database, a cDNA clone with accession 
number AC008754 was identified as a human genomic sequence from chromosome 
19. The full length RUP 10 was cloned by RT-PCR using primers: 
.V-CCATGGGGAACGATICTGTCAGCTACG^r (SEQ.ID.NO.:45; sense) and 

1 0 5 -GCTATGCCTG AAGCC AGTCTTGTG-3 ' (SEQ.ID.NO.:46; antisense) 

and human leukocyte Marathon-Ready cDNA (Clontech) as a template. Advantage 
cDNA polymerase (Clontech) was used for the amplification in a 50jal reaction by the 
following cycle with step 2 to step 4 repeated 35 times: 94°C for 30 seconds; 94°C 
for 1 0 seconds; 62°C for 20 seconds; 72°C for 1 .5 minutes; 72°C for 7 minutes. A 1 .0 

15 Kb PGR fragment was isolated and cloned into the pCRII-TOPO vector (Invitrogen) 
and completely sequenced using the AJBI Big Dye Terminator kit (P.E. Biosystem). 
The nucleic acid sequence of the novel human receptor RUP 10 is set forth in 
SEQ.ID.NO.:5 and the putative amino acid sequence thereof is set forth in 
SEQ.ID.NO.:6. 

20 

d. hRUPl 1 (Seq. Id. Nos. 7 & 8) 

The disclosed human RUP11 was identified based upon the use of GenBank 
database information. While searching the database, a cDNA clone with accession 
25 number AC013396 was identified as a human genomic sequence from chromosome 2. 

in 
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The full length RUP1 1 was cloned by PCR using primers: 
5 '-CCAGGATG TTGTGTCACCGTGGTGGC-3 1 (SEQ.ID.NO.:47; sense), 
5 , -CACAGCGCTGCAGCCCTGCAGCTGGC-3 , (SEQ.ID.NO.:48; antisense) 
and human genomic DNA (Clontech) as a template. TaqPlus Precision DNA 
5 polymerase (Stratagene) was used for the amplification in a 50jil reaction by the 
following cycle with step 2 to step 4 repeated 35 times: 94°C for 3 minutes; 94°C for 20 
seconds; 67°C for 20 seconds; 72°C for 1.5 minutes; 72°C for 7 minutes. A 1 .3 Kb PCR 
fragment was isolated and cloned into the pCRII-TOPO vector (Invitrogen) and 
completely sequenced using the ABI Big Dye Terminator kit (P.E. Biosystem). The 

10 nucleic acid sequence of the novel human receptor RUP1 1 is set forth in SEQ.ID.NO.:7 
and the putative amino acid sequence thereof is set forth in SEQ.ID.NO,:8. 
e. hRUP12(Seq. Id. Nos. 9 & 10) 
The disclosed human RUP12 was identified based upon the use of GenBank 
database. While searching the database, a cDNA clone with accession number 

15 AP0008O8 was identified to encode a new GPCR, having significant homology with rat 
RTA and human masl oncogene GPCRs. The full length RUP12 was cloned by PCR 
using primers: 

5 '-CTTCCTCTCGTAGGGATGAACCAGAC-3 ' (SEQ,ID.NO.:49; sense) 
5'-CTCGCACAGGTGGGAAGCACCTGTGG-3 1 (SEQ.ID.NO.:50; antisense) 
20 and human genomic DNA (Clontech) as template. TaqPlus Precision DNA polymerase 
(Stratagene) was used for the amplification by the following cycle with step 2 to step 4 
repeated 35 times: 94°C for 3 min; 94°C for 20 sec; 65°C for 20sec; 72°C for 2 min and 
72°C for 7 min. A l.Okb PCR fragment was isolated and cloned into the pCRII-TOPO 
vector (invitrogen) and completely sequenced using the ABI Big Dye Terminator kit 
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(P.E. Biosystem) (sec. SE:Q.ID.NO.:9 for nucleic acid sequence and SEQ.ID.NO.: 10 for 
deduced amino acid sequence). 

f. hRUP13 (Seq. Id. Nos. 1 1 & 12) 

The disclosed human RUP13 was identified based upon the use of 
5 GenBank database. While searching the database, a cDNA clone with accession number 
AC01 1780 was identified to encode a new GPCR, having significant homology with 
GPCR fish GPRX-ORYLA. The full length RUP13 was cloned by PGR using primers: 
5 -GCCTGTG ACAGG AGGTACCC1 GG-3 1 (SEQ.ID.NO. :5 1 ; sense) 
S -CATATCCCTCCGAGI G I CCAGCGGC 3 1 (SEQ.ID.NO. :52; antisense) 

10 and human genomic DNA (Clontech) as template. TaqPlus Precision DNA polymerase 
(Stratagene) was used for the amplification by the following cycle with step 2 to step 4 
repeated 35 times: 94°C for 3 min; 94°C for 20 sec; 65°C for 20sec; 72°C for 2 min and 
72°C for 7 min. A 1.35kb PCR fragment was isolated and cloned into the pCRII-TOPO 
vector (Invitrogen) and completely sequenced using the ABI Big Dye Terminator kit 

15 (P.E. Biosystem) (see, SEQ.ID.NO.: 1 1 for nucleic acid sequence and SEQ.ID.NO.:12 
for deduced amino acid sequence). 

g. hRUP14(Seq. Id. Nos. 13 & 14) 

The disclosed human RUP14 was identified based upon the use of GeneBank 
database information. While searching the database, a cDNA clone with Accession 

20 Number AL1371 18 was identified as a human genomic sequence from chromosome 
13. The full length RUP14 was cloned by PCR using primers: 
5'-GCATGGAGAGAAAAlTI ATGTCCTTGCAACC-3' (SEQ.ID.NO.:53; sense) 
S'-CAAGAACAGGTCTCATCTAAGAGCTCC-S 1 (SEQ.ID.NO. :54; antisense) 
and human genomic DNA (Promega) as a template. Taq Plus Precision polymerase 

25 (Stratagene) and 5% DMSO were used for the amplification by the following cycle 
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with step 2 and step 3 repeated 35 times: 94°C for 3 minute; 94°C for 20 seconds; 
58°C for 2 minutes; 72°C for 10 minutes. 

A 1.1 Kb PCR fragment was isolated and cloned into the pCRII-TOPO vector 
(Invitrogen) and completely sequenced using the ABI Big Dye Terminator kit (P.E. 
5 Biosystem) (see, SEQ.ID.NO.: 13 for nucleic acid sequence and SEQ.ID.NO.: 14 for 
deduced amino acid sequence). The sequence of RUP14 clones isolated from human 
genomic DNA matched with the sequence obtained from database. 

h. hRUP15(Seq. Id.Nos. 15& 16) 

The disclosed human RUP15 was identified based upon the use of GeneBank 
1 0 database information. While searching the database, a cDNA clone with Accession 
Number AC016468 was identified as a human genomic sequence. The full length 
RUP1 5 was cloned by PCR using primers: 

S'-GCTGTrGCCATGACGTCCACCTGCACO' (SEQ.lD.NO.:55; sense) 
5 '-GGACAGTrCAAGGTrTGCCTTAGAAC-O' (SEQ.ID.NO.:56; antisense) 

1 5 and human genomic DNA (Promega) as a template. Taq Plus Precision polymerase 
(Stratagene) was used for the amplification by the following cycle with step 2 to 4 
repeated 35 times: 94°C for 3 minute; 94°C for 20 seconds; 65°C for 20 seconds; 
72°C for 2 minutes and 72°C for 7 minutes. 

A 1 .5 Kb PCR fragment was isolated and cloned into the pCRH-TOPO vector 

20 (Invitrogen) and completely sequenced using the ABI Big Dye Terminator kit (P.E. 
Biosystem). See, SEQ.ID.NO,: 15 for nucleic acid sequence and SEQ.ID.NO.: 16 for 
deduced amino acid sequence. The sequence of RUP15 clones isolated from human 
genomic DNA matched with the sequence obtained from database. 

i. hRUPI6 (Seq. Id. Nos. 17 & 18) 
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The disclosed human RUP16 was identified based upon the use of GeneBank 
database information. While searching the database, a cDNA clone with Accession 
Number AL 136 106 was identified as a human genomic sequence from chromosome 
13. The full length RUP16 was cloned by PCR using primers: 
5 S'-CnTCGATACTGCTCCl ATGCTC-3' (SEQJD.NO.:57; sense, 5' of initiation codon), 

5 '-GTAGTCC ACTG AAAGTCC AGTG ATCC-3 1 (SEQ.ID.NO.:58; antisense, 3 1 of stop codon) 
and human skeletal muscle Marathon-Ready cDNA (Clontech) as template. Advantage 
cDNA polymerase (Clontech) was used for the amplification in a 50ul reaction by the 
following cycle with step 2 to 4 repeated 35 times: 94°C for 30 seconds; 94°C for 5 
10 seconds; 69°C for 15 seconds; 72°C for 1 minute and 72°C for 5 minutes. 

A 1.1 Kb PCR fragment was isolated and cloned into the pCRll-TOPO vector 
(Invitrogen) and completely sequenced using the T7 sequenase kit (Amsham). See, 
SEQ.ID.NO.:17 for nucleic acid sequence and SEQ.lD.NO.:l 8 for deduced amino acid 
sequence. The sequence of RUP16 clones matched with four unordered segments of 
15 AL136106, indicating that the RUP16 cDNA is composed of 4 exons. 
j. hRUP17 (Seq. Id. Nos. 19 & 20) 
The disclosed human RUP17 was identified based upon the use of GeneBank 
database information. While searching the database, a cDNA clone with Accession 
Number AC023078 was identified as a human genomic sequence from chromosome 
20 11. The full length RUP 1 7 was cloned by PCR using primers: 

5 '-TTTCTG AGC ATGG ATCC AACC ATCTC-3 * (SEQ.ID.NO.:59; sense, containing initiation 
codon) 

5 '-CTGTCTGACAGGGCAG AGGCTCTTC-3 1 (SEQ.ID.NO.:60; antisense, 3' of stop codon) 
and human genomic UNA (Promega) as template. Advantage cDNA polymerase mix 
25 (Clontech) was used for the amplification in a lOOul reaction with 5% DMSO by the 
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following cycle with step 2 to 4 repeated 30 times: 94°C for 1 min; 94°C for 1 5 sec; 
67°C for 20 sec; 72°C for 1 min and 30 sec; and 72°C for 5 min. 

A 970bp PCR fragment was isolated from 1% agarose gel and cloned into the 
pCRII-TOPO vector (Invitrogen) and completely sequenced using the ABI Big Dye 
5 Termiantor Kit (P.E. Biosystem). See, SEQ.ID.NO.: 1 9 for nucleic acid sequence and 
SEQ.1D.NO.:20 for deduced amino acid sequence. 

k. hRUP18(Seq. Id. Nos. 21 & 22) 
The disclosed human RUP18 was identified based upon the use of GeneBank 
database information. While searching the database, a cDNA clone with Accession 
10 Number AC008547 was identified as a human genomic sequence from chromosome 
5. The full length RUP18 was cloned by PCR using primers: 
5 '-GG AACTCGTATAGACCCAGCGTCGCTCC-3 ' (SEQ.ID.NO.:61 ; sense. 5' of the 
initiation codon), 

5 , -GGAGGTTGCGCCTTAGCGACAGATGACC-3 , (SEQ.lD.NO.:62; antisense, V of stop 
1 5 codon) 

and human genomic DNA (Promega) as template. TaqPlus precision DNA 
polymerase (Stratagene) was used for the amplification in a lOOul reaction with 5% 
DMSO by the following cycle with step 2 to 4 repeated 35 times: 95°C for 5 mm; 
95°C for 30 sec; 65°C for 30 sec; 72°C for 2 min; and 72°C for 5 mm. 
20 A 1.3kb PCR fragment was isolated from 1% agarose gel and cloned into the 

pCRII-TOPO vector (Invitrogen) and completely sequenced using the ABI Big Dye 
Termiantor Kit (P.E. Biosystem). See, SEQ.ID.NO.:21 for nucleic acid sequence and 
SEQ.ID.NO.:22 for deduced amino acid sequence. 

1. HRUP19 (Seq. Id. Nos. 23 & 24) 
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The disclosed human RUP19 w as identified based upon the use of GeneBank 
database information. While searching the database, a cDNA clone with Accession 
Number AC026331 was identified as a human genomic sequence from chromosome 
12. The full length RUP19 was cloned by PGR using primers: 
5 5 '-CTGCACCCGG ACACTTGCTCTG-3 1 (SEQ.ID.NO.:63; sense, 5' of initiation codon), 

5 '-GTCTGCTTGTTCAGTGCC ACTCA AC-3 ' (SEQ.ID.NO.:64; antisense, containing the stop 
codon) 

and human genomic DNA (Promega) as template. TaqPlus Precision DNA 
polymerase (Stratagene) was used for the amplification with 5% DMSO by the 

10 following cycle with step 2 to 4 repeated 35 times: 94°C for 1 min; 94°C for 15 sec; 
70°C for 20 sec; 72°C for 1 mm and 30 sec; and 72°C for 5 min. 

A 1.1 kp PCR fragment was isolated from 1% agarose gel and cloned into the 
pCRII-TOPO vector (Invitrogen) and completely sequenced using the ABI Big Dye 
Termiantor Kit (P.E. Biosystem). See, SEQ.ID.NO.:23 for nucleic acid sequence and 

1 5 SEQ.ED.NO/.24 for deduced ammo acid sequence. 

m. hRUP20 (Seq. Id. Nos. 25 & 26) 
The disclosed human RUP20 was identified based upon the use of GeneBank 
database information. While searching the database, a cDNA clone with Accession 
Number AL1 61458 was identified as a human genomic sequence from chromosome 

20 1 . The full length RUP20 was cloned by PCR using primers: 

5'-TATCTGCAATTCTATTCTAGCTCCTG-3 1 (SEQ.ID.NO.:65; sense, 5" of initiation codon), 
5'-TGTCCCTAATAAAGTCACATGAATGC-3 1 (SEQ.ID.NO.:66; antisense, 3 1 of stop codon) 
and human genomic DNA (Promega) as template. Advantage cDNA polymerase mix 
(Clonetech) was used for the amplification with 5% DMSO by the following cycle with 
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step 2 to 4 repeated 35 times: 94°C for 1 min; 94°C for 1 5 sec; 60°C for 20 sec; 72°C 
for 1 min and 30 sec; and 72°C for 5 mm. 

A 1.0 kp PCR fragment was isolated from 1% agarose gel and cloned into the 
pCRIl-TOPO vector (Invitrogen) and completely sequenced using the ABI Big Dye 
Termiantor Kit (P.E. Biosystem). See, SEQ.lD.NO.:25 for nucleic acid sequence and 
SEQ.ID.NO.:26 for deduced amino acid sequence. 

n. hRUP21 (Seq. Id. Nos. 27 & 28) 

The disclosed human RUP21 was identified based upon the use of GeneBank 
database information. While searching the database, a cDNA clone with Accession 
Number AC026756 was identified as a human genomic sequence from chromosome 
1 3. The full length RUP21 was cloned by PCR using primers: 
5'- GGAGACAACCATGAATGAGCCAC -3' (SEQ.ID.NO.:67; sense) 
5 1 - TATTTCAAGGGTTGTTTGAGTAAC -3' (SEQ.ID.NO.:68; antisense) 
and human genomic DNA (Promega) as template. Taq Plus Precision polymerase 
(Stratagene) was used for the amplification in a lOOul reaction with 5% DMSO by the 
following cycle with step 2 to 4 repeated 30 times: 94°C for 1 min; 94°C for 15 sec; 
55°C for 20 sec; 72°C for 1 min and 30 sec; and 72°C for 5 min, 

A 1 ,014 bp PCR fragment was isolated from 1% agarose gel and cloned into the 
pCRII-TOPO vector (Invitrogen) and completely sequenced using the ABI Big Dye 
Termiantor Kit (P.E. Biosystem). See, SEQ.ED.NO.:27 for nucleic acid sequence and 
SEQ.ID.NO.:28 for deduced amino acid sequence. 

o. hRUP22 (Seq. Id. Nos. 29 & 30) 

The disclosed human RUP22 was identified based upon the use of GeneBank 
database information. While searching the database, a cDNA clone with Accession 

34 
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Number AC027026 was identified as a human genomic sequence from chromosome 
1 1. The full length RUP22 was cloned by PCR using primers: 
5- GGCACCAGTGGAGGTTTTCTGAGCATG -3 1 (SEQ.ID.NO.:69; sense, containing 
initiation codon) 

5 5 , -CTGATGGAAG TAGAGGCTGTCCATCTC-3' (SEQ.ID.NO.:70; antisensc, 3' of stop 
codon) 

and human genomic DNA (Promega) as template. TaqPlus Precision DNA polymerase 
(Stratagene) was used for the amplification in a lOOul reaction with 5% DMSO by the 
following cycle with step 2 to 4 repeated 30 times: 94 U C, 1 minutes 94 U C, 15 seconds 
1 0 55°C, 20 seconds 72°C, 1 .5 minute 72°C, 5 minutes. 

A 970bp PCR fragment was isolated from 1% agarose gel and cloned into the 
pCRII-TOPO vector (Invitrogen) and completely sequenced using the ABI Big Dye 
Termiantor Kit (P.E. Biosystem). See, SEQ.ID.NO.:29 for nucleic acid sequence and 
SEQ.ID.NO.:30 for deduced amino acid sequence. 
1 5 p. HRUP23 (Seq. Id. Nos. 31 & 32) 

The disclosed human RUP23 was identified based upon the use of GeneBank 
database information. While searching the database, a cDNA clone with Accession 
Number AC007104 was identified as a human genomic sequence from chromosome 

4. The full length RUP23 was cloned by PCR using primers: 

20 5'-CCTGGCGAGCCGCrAGCGCCATG-3' (SEQ.ID.NO.:71 ; sense, ATG as the initiation 
codon), 

5 , -ATGAGCCCTGCCAGGCCCTCAGT-3 , (SEQ.ID.NO.:72; antisense, TCA as the stop 
codon) 

and human placenta Marathon-Ready cDNA (Clontech) as templaie. Advantage cDNA 
25 polymerase (Clontech) was used for the amplification in a 50ul reaction by the following 
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cycle with step 2 to 4 repeated 35 times: 95°C for 30 sec; 95°C for 15 sec; 66°C for 20 
sec; 72°C for 1 min and 20 sec; and 72°C for 5 min. 

A 1.0 kb PCR fragment was isolated and cloned into the pCRII-TOPO vector 
(Invitrogen) and completely sequenced using the ABI Big Dye Terminator Kit (P.E. 
5 Biosystem). See, SEQ.ID,NO.:31 for nucleic acid sequence and SEQ.ID.NO.:32 for 
deduced amino acid sequence. 

q. hRUP24 (Seq. Id. Nos. 33 & 34) 
The disclosed human RUP25 was identified based upon the use of GeneBank 
database information. While searching the database, a cDNA clone with Accession 
10 Number AC026331 was identified as a human genomic sequence from chromosome 
12. The full length RUP25 was cloned by PCR using primers: 
5 '-GCTGGAGCATTC ACTAGGCGAG-3 ' (SEQ.ID.NO.:73; sense, 5'of initiation codon), 
5 AG ATCCTGGTTCTTGGTGACAATG-3 ' (SEQ.ED.NO.:74; antisense, 3' of stop codon) 
and human genomic DNA (Promega) as template. Advantage cDNA polymerase mix 
15 (Clontech) was used for the amplification with 5% DMSO by the following cycle with 
step 2 to 4 repeated 35 times: 94°C for 1 minute; 94°C for 15 seconds; 56°C for 20 
seconds 72°C for 1 minute 30 seconds and 72°C for 5 minutes. 

A 1.2kb PCR fragment was isolated from 1% agarose gel and cloned into the 
pCRII-TOPO vector (Invitrogen) and completely sequenced using the ABI Big Dye 
20 Termiantor Kit (P.E, Biosystem). See, SEQ.TD.NO.:33 for nucleic acid sequence and 
SEQ.ID.NO.:34 for deduced amino acid sequence. 

r. hRUP25 (Seq. Id. Nos. 35 & 36) 
The disclosed human RUP25 was identified based upon the use of GeneBank 
database information. While searching the database, a cDNA clone with Accession 
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Number AC026331 was identified as a human genomic sequence from chromosome 
12. The lull length RUP25 was cloned by PCR using primers: 

5-GCTGGAGCA7TCAC1 AGGCGAG-.V (SHQ ID.NO. 75; sense, 5 of initiation codon), 
5-AGATCCTGGTTCTTGGTGACAATG-3 1 (SEQ.ID.NO.:76; antisense, 3 1 of stop codon) 
5 and human genomic DNA (Promega) as template. Advantage cDNA polymerase mix 
(Clontech) was used for the amplification with 5% DMSO by the following cycle with 
step 2 to 4 repeated 35 times: 94°C for 1 minute; 94°C for 15 seconds; 56°C for 20 
seconds 72°C for 1 minute 30 seconds and 72°C for 5 minutes. 

A 1.2kb PCR fragment was isolated from 1% agarose gel and cloned into the 
10 pCRII-TOPO vector (Invitrogen) and completely sequenced using the ABI Big Dye 
Termiantor Kit (P.E. Biosystem). See, SEQ.ID.NO.:35 for nucleic acid sequence and 
SEQ.ID.NO.:36 for deduced amino acid sequence. 

s. hRUP26 (Seq. Id. Nos. 37 & 38) 
The disclosed human RUP26 was identified based upon the use of GeneBank 
15 database information. While searching the database, a cDNA clone with Accession 
Number AC023040 was identified as a human genomic sequence from chromosome 
2. The full length RUP26 was cloned by RT-PCR using RUP26 specific primers: 
5 AGCCATCCCTGCCAGGAAGC ATG G-3 , (SEQ.ID.NO.:77; sense, containing initiation 
codon) 

20 5 -CC AGAC1 GTGG ACTCAAGAACT CT A GG-3 1 (SEQ.ID.NO.:78; antisense, containing stop 
codon) 

and human pancreas Marathon - Ready cDNA (Clontech) as template. Advantage cDNA 
polymerase mix (Clontech) was used for the amplification in a lOOjil reaction with 5% 
DMSO by the following cycle with step 2 to 4 lepeated 35 times: 94 n C for 5 minute; 
25 95°C for 30 seconds; 65°C for 30 seconds 72°C for 2 minute and 72°C for 5 minutes. 

3^ 
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A 1.1 kb PCR fragment was isolated from 1% agarose gel and cloned into the 
pCRIl-TOPO vector (Invitrogen) and completely sequenced using the ABI Big Dye 
Termiantor Kit (P.E. Biosystem). See, SEQ.ID.NO.:37 for nucleic acid sequence and 
SEQ.ID.NO.:38 for deduced amino acid sequence. 
5 t. hRUP27 (Seq. Id. Nos. 39 & 40) 

The disclosed human RUP27 was identified based upon the use of GeneBank 
database information. While searching the database, a cDNA clone with Accession 
Number AC027643 was identified as a human genomic sequence from chromosome 
12. The full length RUP27 was cloned by PCR using RUP27 specific primers: 
10 5 '-AGTCCACGAAC AATGAATCCATTTCATG-3 ' (SEQ.HXNO.:79; sense, containing 
initiation codon), 

5 '-ATCATGTCTAGACTCATGGTGATCC-3 ' (SEQ.1D.NO.:80; antisense, 3 1 of stop codon) 
and the human adult brain Marathon-Ready cDNA (Clontech) as template. Advantage cDNA 
polymerase mix (Clontech) was used for the amplification in a 50\x\ reaction with 5% 
15 DMSO by the following cycle with step 2 to 4 repeated 35 times: 94°C for 1 minute; 
94°C for 10 seconds; 58°C for 20 seconds 72°C for 1 minute 30 seconds and 72°C for 5 
minutes. 

A 1.1 kb PCR fragment was isolated from 1% agarose gel and cloned into the 
pCRII-TOPO vector (Invitrogen) and completely sequenced using the ABI Big Dye 
20 Termiantor Kit (P.E. Biosystem). See, SEQ.ID.NO.:35 for nucleic acid sequence and 
SEQ.ID.NO.:36 for deduced amino acid sequence. The sequence of RUP27 cDNA 
clone isolated from human brain was determined to match with five unordered segments 
of AC027643, indicating that the RUP27 cDNA is composed of 5 exons. 
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Example 2 

Preparation of Non-Endogfnoi s, Constitl tivflv Activated Gpcrs 

Those skilled in the art are credited with the ability to select techniques for 
mutation of a nucleic acid sequence. Presented below are approaches utilized to 
5 create non-endogenous versions of several of the human GPCRs disclosed above. 
The mutations disclosed below are based upon an algorithmic approach whereby the 
16 th amino acid (located in the TC3 region of the GPCR) from a conserv ed proline (or 
an endogenous, conservative substitution therefore) residue (located in the TM6 
region of the GPCR, near the TM6/IC3 interface) is mutated, preferably to an alanine, 
10 histidine, arginine or lysine amino acid residue, most preferably to a lysine amino acid 
residue. 

1 . Transformer Site-Directed ™ Mutagenesis 

Preparation of non-endogenous human GPCRs may be accomplished on human 
GPCRs using Transformer Site-Directed™ Mutagenesis Kit (Clontech) according to the 
1 5 manufacturer instructions. Two mutagenesis primers are utilized, most preferably a 
lysine mutagenesis oligonucleotide that creates the lysine mutation, and a selection 
marker oligonucleotide. For convenience, the codon mutation to be incorporated into the 
human GPCR is also noted, in standard form (Table D): 



20 TABLE D 



Receptor Identifier 


Codon Mutation 


HRUP8 


V274K 


hRUP9 


T249K 


hRUPlO 


R232K 


hRUPll 


M204K 


HRUP12 


F220K 


HRUP16 


A238K 



3*) 
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hRI PI 7 


Y215K 


hRlIP18 


L294K 


hRlJP19 


T219K 


hRlJP20 


K248A 
K248H 
K248R 


hRI JP21 


R240K 


hRIJP22 


Y222K 


KRUP24 


A245K 


HRUP25 


I230K 


hRUP26 


V285K 


hRliP27 


T248K 



2. QuikChange™ Site-Directed™ Mutagenesis 

Preparation of non-endogenous human GPCRs can also be accomplished by 
5 using QuikChange™ Site-Directed™ Mutagenesis Kit (Stratagene, according to 
manufacturer's instructions). Endogenous GPCR is preferably used as a template and 
two mutagenesis primers utilized, as well as, most preferably, a lysine mutagenesis 
oligonucleotide and a selection marker oligonucleotide (included in kit). For 
convenience, the codon mutation incorporated into the novel human GPCR and the 
10 respective oligonucleotides are noted, in standard form (Table E): 



TABLE E 



Receptor 
Identifier 


Codon 
Mutation 


5*-3' orientation (sense), 
(SEQ.ED.NO.) mutation 
underlined 


5*-3* orientation 
(antisense) (SEQ.ID.NO.) 


Cycle Conditions 
Min ('), Sec (") 
Cycles 2-4 
repeated 1 6 times 


hRUP13 


A268K 


GGGGAG GG A A A GC A A 

AGGTGGTCCTCCTGG 

(81) 


CCAGGAGAACCACCT 

TTGCT1TCCCTCCCC 

(82) 


98° for T 
98° for 30" 
56°C for 30" 
72° for IT 40" 
72° for 5 ' 


hRUP14 


L246K 


CAGGAAGGCAAAGAC 
CACCATCATCATC (85) 


GATGATGATGGTGGT 
CTTTG CCTTCCTG (86) 


98° for V 
98° for 30" 
55°Cfor30" 
72° for IT 40" 
72° for 5' 
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hRUPl> 


A398K 


CCAG'I gca aagctaag 

AAAGTGATCTTC(89) 


GAAGATCACTITCTTA 
G CT TTG C A CTG G (90) 


98° for 2' 
98° for 30" 
55°C for 30" 
72° for 1 1 ' 40' 1 
72° for 5' 


hRUP23 


W275K 


CiCCCiCCAC CGCGCCAA 
GAGGAAGATI GGC (93) 


GCCAA TC ITCCTCTTG 

GCGCGGTGGCGGC 

(94) 


98° for 2' 
98° for 30" 
56°C for 30" 
72° for 1 T 40" 
72° for y 



The non-endogenous human GPCRs were then sequenced and the derived and 
verified nucleic acid and amino acid sequences are listed in the accompanying 
5 "Sequence Listing" appendix to this patent document, as summarized in Table F 
below: 



TABLE F 



Non Endogenous Human 
GPCR 


Nucleic Acid Sequence Listing 


Amino Acid Sequence 
Listing 


hRUP13 


SEQ.TD.NO. 83 


SEQ.ID.NO.:S4 


HRUP14 


SEQ.1D.NO/87 


SEQ.ID.NO.:88 


hRUP15 


SEQ.ID.NO. 91 


SEQ.ID.NO.:92 


hRUP23 


SEQ.ID.NO.:95 


SEQ.TD.NO. :96 



Example 3 
1 0 Receptor Expression 

Although a variety of cells are available to the art for the expression of 
proteins, it is most preferred that mammalian cells be utilized. The primary reason for 
this is predicated upon practicalities, i.e., utilization of, e.g., yeast cells for the 
15 expression of a GPCR, while possible, introduces into the protocol a non-mammalian 
cell which may not (indeed, in the case of yeast, does not) include the receptor- 
coupling, genetic-mechanism and secretary pathways that have evolved for 
mammalian systems - thus, results obtained in non-mammalian cells, while of 
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potential use, are not as preferred as that obtained from mammalian cells. Of the 
mammalian cells, COS-7, 293 and 293T cells are particularly preferred, although the 
specific mammalian cell utilized can be predicated upon the particular needs of the 
artisan. 

5 a. Transient Transfection 

On day one, 6xl0 6 / 10 cm dish of 293 cells well were plated out. On day two, 
two reaction tubes were prepared (the proportions to follow for each tube are per plate): 
tube A was prepared by mixing 4\ig DNA (e.g., pCMV vector; pCMV vector with 
receptor cDNA, etc.) in 0.5 ml serum free DMEM (Gibco BRL); tube B was prepared by 

10 mixing 24jal lipofectamine (Gibco BRL) in 0.5ml serum free DMEM. Tubes A and B 
were admixed by inversions (several times), followed by incubation at room temperature 
for 30-45min. The admixture is referred to as the "transfection mixture". Plated 293 
cells were washed with 1XPBS, followed by addition of 5 ml serum free DMEM. 1 ml 
of the transfection mixture were added to the cells, followed by incubation for 4hrs at 

15 37°C/5% CO2. The transfection mixture was removed by aspiration, followed by the 
addition of 10ml of DMEM/10% Fetal Bovine Serum. Cells were incubated at 37°C/5% 
CO2. After 48hr incubation, cells were harvested and utilized for analysis, 
b. Stable Cell Lines: Gs Fusion Protein 

Approximately 12xl0 6 293 cells are plated on a 15cm tissue culture plate. 

20 Grown in DME High Glucose Medium containing ten percent fetal bovine serum and 
one percent sodium pyruvate, L-glutamine, and anti-biotics. Twenty-four hours 
following plating of 293 cells to -80% confluency, the cells are transfected using 12)ag 
of DNA. The 12 jag of DNA is combined with 60ul of lipofectamine and 2mL of DME 
High Glucose Medium without serum. The medium is aspirated from the plates and the 

25 cells are washed once with medium without serum. The DNA, lipofectamine, and 
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medium mixture is added to the plate along with lOmL of medium without serum. 
Following incubation at 37 degrees Celsius for four to five hours, the medium is 
aspirated and 25ml of medium containing serum is added. Twenty- four hours following 
transfection, the medium is aspirated again, and fresh medium with serum is added. 
5 Forty-eight hours following transfection, the medium is aspirated and medium with 
serum is added containing geneticin (G418 drug) at a final concentration of 500jag/mL. 
The transfected cells now undergo selection for positively transfected cells containing 
the G418 resistant gene. The medium is replaced every four to five days as selection 
occurs. During selection, cells are grown to create stable pools, or split for stable clonal 
10 selection. 
Example 4 

Assays For determination of Constitutive Activity 
of Non-Endogenous GPCRs 

15 A variety of approaches are available for assessment of constitutive activity of 

the non-endogenous human GPCRs. The following are illustrative; those of ordinary 

skill in the art are credited with the ability to determine those techniques that are 

preferentially beneficial for the needs of the artisan. 

1. Membrane Binding Assays: [ 35 S]GTPyS Assay 

20 

When a G protein-coupled receptor is in its active state, either as a result of 
ligand binding or constitutive activation, the receptor couples to a G protein and 
stimulates the release of GDP and subsequent binding of GTP to the G protein. The 
alpha subunit of the G protein-receptor complex acts as a GTPase and slowly hydrolyzes 
25 the GTP to GDP, at which point the receptor normally is deactivated. Constitutively 
activated receptors continue to exchange GDP for GTP. The non-hydrolyzable GTP 
analog, [ K S]GTPyS, can be utilized to demonstrate enhanced binding of [ S]GTPyS to 
membranes expressing constitutively activated receptors. The advantage of using 

43 
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[^SjGTPyS binding to measure constitutive activation is that: (a) it is genencally 
applicable to all G protein-coupled receptors; (b) it is proximal at the membrane surface 
making it less likely to pick-up molecules which affect the intracellular cascade. 

The assay utilizes the ability of G protein coupled receptors to stimulate 
[ 35 S]GTPyS binding to membranes expressing the relevant receptors. The assay can, 
therefore, be used in the direct identification method to screen candidate compounds to 
known, orphan and constitutively activated G protein-coupled receptors. The assay is 
generic and has application to drug discovery at all G protein-coupled receptors. 

The [ 3!> S]GTPyS assay was incubated in 20 mM HEPES and between 1 and 
about 20mM MgCh (this amount can be adjusted for optimization of results, although 
20mM is preferred) pH 7.4, binding buffer with between about 0.3 and about 1.2 nM 
[ 35 S]GTPyS (this amount can be adjusted for optimization of results, although 1.2 is 
preferred ) and 12.5 to 75 p,g membrane protein (e.g, 293 cells expressing the Gs Fusion 
Protein; this amount can be adjusted for optimization) and 10 jaM GDP (this amount can 
be changed for optimization) for 1 hour. Wheatgerm agglutinin beads (25 jil; 
Amersham) were then added and the mixture incubated for another 30 minutes at room 
temperature. The tubes were then centrifuged at 1500 x g for 5 minutes at room 
temperature and then counted in a scintillation counter. 
2. Adenylyl Cyclase 

A Flash Plate™ Adenylyl Cyclase kit (New England Nuclear; Cat. No, 
SMP004A) designed for cell-based assays can be modified for use with crude plasma 
membranes. The Flash Plate wells can contain a scintillant coating which also contains a 
specific antibody recognizing cAMP. The cAMP generated in the wells can be 
quantitated by a direct competition for binding of radioactive cAMP tracer to the cAMP 

44 
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antibody. The following serves as a brief protocol for the measurement of changes in 
cAMP levels in whole cells that express the receptors. 

Transfected cells were harvested approximately twenty 7 four hours after transient 
transfection. Media is carefully aspirated off and discarded. 10ml of PBS is gently 
added to each dish of cells followed by careful aspiration. 1ml of Sigma cell 
dissociation buffer and 3ml of PBS are added to each plate. Cells were pipeted off the 
plate and the cell suspension was collected into a 50ml conical centrifuge tube. Cells 
were then centrifuged at room temperature at 1,100 rpm for 5 min. The cell pellet was 
carefully re-suspended into an appropriate volume of PBS (about 3ml/plate). The cells 
were then counted using a hemocytometer and additional PBS was added to give the 
appropriate number of cells (with a final volume of about 50 |il/well). 

cAMP standards and Detection Buffer (comprising 1 jiCi of tracer [ 125 I cAMP 
(50 jil] to 1 1 ml Detection Buffer) was prepared and maintained in accordance with the 
manufacturer's instructions. Assay Buffer was prepared fresh for screening and 
contained 50jal of Stimulation Buffer, 3ul of test compound (12uM final assay 
concentration) and 50)il cells, Assay Buffer was stored on ice until utilized. The assay 
was initiated by addition of 50fil of cAMP standards to appropriate wells followed by 
addition of 50ul of PBS A to wells H-l 1 and HI 2. 50^1 of Stimulation Buffer was added 
to all wells. DMSO (or selected candidate compounds) was added to appropriate wells 
using a pin tool capable of dispensing 3p.I of compound solution, with a final assay 
concentration of 12pM test compound and 100}j.l total assay volume. The cells were 
then added to the wells and incubated for 60 min at room temperature. 100|al of 
Detection Mix containing tracer cAMP was then added to the wells. Plates were then 
incubated additional 2 hours followed by counting in a Wallac MicroBeta scintillation 
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counter. Values of cAMP/well were then extrapolated from a standard cAMP curve 
which was contained within each assay plate. 

3. Cell-Based cAMP for Gi Coupled Target GPCRs 

5 TSHR is a Gs coupled GPCR that causes the accumulation of cAMP upon 

activation. TSHR will be constitutively activated by mutating amino acid residue 623 
(i.e., changing an alanine residue to an isoleucine residue). A Gi coupled receptor is 
expected to inhibit adenylyl cyclase, and, therefore, decrease the level of cAMP 
production, which can make assessment of cAMP levels challenging. An effective 
10 technique for measuring the decrease in production of cAMP as an indication of 
constitutive activation of a Gi coupled receptor can be accomplished by co-transfecting, 
most preferably, non-endogenous, constitutively activated TSHR (TSHR-A623I) (or an 
endogenous, constitutively active Gs coupled receptor) as a "signal enhancer" with a Gi 
linked target GPCR to establish a baseline level of cAMP. Upon creating a non- 
15 endogenous version of the Gi coupled receptor, this non-endogenous version of the 
target GPCR is then co-transfected with the signal enhancer, and it is this material that 
can be used for screening. We will utilize such approach to effectively generate a signal 
when a cAMP assay is used; this approach is preferably used in the direct identification 
of candidate compounds against Gi coupled receptors. It is noted that for a Gi coupled 
20 GPCR, when this approach is used, an inverse agonist of the target GPCR will increase 
the cAMP signal and an agonist will decrease the cAMP signal. 

On day one, 2X10 4 2 93 and 293 cells/well will be plated out. On day two, two 
reaction tubes will be prepared (the proportions to follow for each tube are per plate): 
tube A will be prepared by mixing 2|ig DNA of each receptor transfected into the 
25 mammalian cells, for a total of 4\xg DNA (e.g., pCMV vector; pCMV vector with 
mutated THSR (TSHR-A623I); TSHR-A623I and GPCR, etc.) in 1.2ml serum free 
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DMEM (Irvine Scientific, Irvine, CA); tube B will be prepared by mixing 120pl 
lipofectamine (Gibco BRL) in 1.2ml serum free DMEM. Tubes A and B will then be 
admixed by inversions (several times), followed by incubation at room temperature for 
30-45min. The admixture is referred to as the "transfection mixture". Plated 293 cells 
5 will be washed with 1 XPBS, followed by addition of 10ml serum free DMEM. 2.4ml of 
the transfection mixture will then be added to the cells, followed by incubation for 4hrs 
at 37°C/5% CO:- The transfection mixture will then be removed by aspiration, followed 
by the addition of 25ml of DMEM/10% Fetal Bovine Serum. Cells will then be 
incubated at 37°C/5% CO?. After 24hr incubation, cells will then be harvested and 

10 utilized for analysis. 

A Flash Plate™ Adenylyl Cyclase kit (New England Nuclear; Cat. No. 
SMP004A) is designed for cell-based assays, however, can be modified for use with 
crude plasma membranes depending on the need of the skilled artisan. The Flash Plate 
wells will contain a scmtillant coating which also contains a specific antibody 

15 recognizing cAMP. The cAMP generated in the wells can be quantitated by a direct 
competition for binding of radioactive cAMP tracer to the cAMP antibody. The 
following serves as a brief protocol for the measurement of changes in cAMP levels in 
whole cells that express the receptors. 

Transfected cells will be harvested approximately twenty four hours after 

20 transient transfection. Media will be carefully aspirated off and discarded. 10ml of PBS 
will be gently added to each dish of cells followed by careful aspiration. 1ml of Sigma 
cell dissociation buffer and 3ml of PBS will be added to each plate. Cells will be pipeted 
off the plate and the cell suspension will be collected into a 50ml conical centrifuge tube. 
Cells will then be centrifuged at room temperature at 1,100 rpm for 5 min. The cell 

25 pellet will be carefully re-suspended into an appropriate volume of PBS (about 
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3ml/plate). The cells will then he counted using a hemocytometer and additional PBS is 
added to give the appropriate number of cells (with a final volume of about 
50^1/well). 

cAMP standards and Detection Buffer (comprising 1 jaCi of tracer [ 125 I cAMP 
5 (50 fx\] to 1 1 ml Detection Buffer) will be prepared and maintained in accordance with 
the manufacturer's instructions. Assay Buffer should be prepared fresh for screening 
and contained 50jil of Stimulation Buffer, 3ul of test compound (12uM final assay 
concentration) and SOjil cells, Assay Buffer can be stored on ice until utilized. The assay 
can be initiated by addition of 50jnl of cAMP standards to appropriate wells followed by 

10 addition of 50^1 of PBSA to wells H-l 1 and H12. 50ul of Stimulation Buffer will be 
added to all wells. Selected compounds {e.g., TSH) will be added to appropriate wells 
using a pin tool capable of dispensing 3jj1 of compound solution, with a final assay 
concentration of 12|aM test compound and lOOfil total assay volume. The cells will then 
be added to the wells and incubated for 60 min at room temperature. 100jil of Detection 

15 Mix containing tracer cAMP will then be added to the wells. Plates were then incubated 
additional 2 hours followed by counting in a Wallac MicroBeta scintillation counter. 
Values of cAMP/well will then be extrapolated from a standard cAMP curve which is 
contained within each assay plate. 



293 and 293T cells are plated-out on 96 well plates at a density of 2 x 10 4 cells 
per well and were transfected using Lipofectamine Reagent (BRL) the following day 
according to manufacturer instructions. A DNA/lipid mixture is prepared for each 6- 
well transfection as follows: 260ng of plasmid DNA in 100|il of DMEM were gently 
25 mixed with 2pl of lipid in lOOjil of DMEM (the 260ng of plasmid DNA consisted of 
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Reporter-Based Assays 



20 



Cre-Luc Reporter Assay (Gs-associated receptors) 
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200ng of a SxCRE-Luc reporter plasmid, 50ng of pCMV comprising endogenous 
receptor or non-endogenous receptor or pCMV alone, and lOng of a GPRS expression 
plasmid (GPRS in pcDNA3 (Invitrogen)). The 8XCRE-Luc reporter plasmid was 
prepared as follows: vector SRIF-fJ-gal was obtained by cloning the rat somatostatin 
5 promoter (-71/4-51) at BglV-Hindlll site in the ppgal-Basic Vector (Clontech). Eight 
(8) copies of cAMP response element were obtained by PGR from an adenovirus 
template AdpCF 1 26GCRE8 (see, 7 Human Gene Therapy 1883 (1996)) and cloned 
into the SRTF-R-gal vector at the Knn-RglV site, resulting in the 8xCRF-f3-gal 
reporter vector. The 8xCRJE-Luc reporter plasmid was generated by replacing the 

10 beta-galactosidase gene in the 8xCRE~P-gal reporter vector with the luciferase gene 
obtained from the pGL3-basic vector (Promega) at the Hindlll-BamHI site. 
Following 30 min. incubation at room temperature, the DNA/lipid mixture was 
diluted with 400 (il of DMEM and 100|il of the diluted mixture was added to each 
well. 100 \i\ of DMEM with 10% FCS were added to each well after a 4hr incubation 

15 in a cell culture incubator. The following day the transfected cells were changed with 
200 nl/well of DMEM with 10% FCS. Eight (8) hours later, the wells were changed 
to 100 fil /well of DMEM without phenol red, after one wash with PBS. Luciferase 
activity were measured the next day using the LucLite™ reporter gene assay kit 
(Packard) following manufacturer instructions and read on a 1450 MicroBeta™ 

20 scintillation and luminescence counter (Wallac). 

b. API reporter assay (Gq-associated receptors) 
A method to detect Gq stimulation depends on the known property of Gq- 
dependent phospholipase C to cause the activation of genes containing API elements 

25 in their promoter. A Pathdetect™ AP-1 cis-Reporting System (Stratagene, Catalogue 
# 219073) can be utilized following the protocol set forth above with respect to the 
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CREB reporter assay, except that the components of the calcium phosphate precipitate 
were 410 ng pAPl-Luc, 80 ng pCMV-receptor expression plasmid, and 20 ng CMV- 
SEAP. 

c. Srf-Luc Reporter Assay (Gq- associated receptors) 

5 One method to detect Gq stimulation depends on the known property of Gq- 

dependent phospholipase C to cause the activation of genes containing serum 
response factors in their promoter. A Pathdetect™ SRF-Luc-Reporting System 
(Stratagene) can be utilized to assay for Gq coupled activity in, e.g., COS7 cells. 
Cells are transfected with the plasmid components of the system and the indicated 

10 expression plasmid encoding endogenous or non-endogenous GPCR using a 
Mammalian Transfection™ Kit (Stratagene, Catalogue #200285) according to the 
manufacturer's instructions. Briefly, 410 ng SRF-Luc, 80 ng pCMV -receptor 
expression plasmid and 20 ng CMV-SEAP (secreted alkaline phosphatase expression 
plasmid; alkaline phosphatase activity is measured in the media of transfected cells to 

15 control for variations in transfection efficiency between samples) are combined in a 
calcium phosphate precipitate as per the manufacturer's instructions. Half of the 
precipitate is equally distributed over 3 wells in a 96-well plate, kept on the cells in a 
serum free media for 24 hours. The last 5 hours the cells are incubated with l|iM 
Angiotensin, where indicated. Cells are then lysed and assayed for luciferase activity 

20 using a Luclite™ Kit (Packard, Cat. # 601691 1) and "Trilux 1450 Microbeta" liquid 
scintillation and luminescence counter (Wallac) as per the manufacturer's 
instructions. The data can be analyzed using GraphPad Prism™ 2.0a (GraphPad 
Software Inc.). 

25 
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d. Intracellular IP3 Accumulation Assay (Gq-associated 
receptors) 

On clay 1, cells comprising the receptors (endogenous and/or non-endogenous) 
5 can be plated onto 24 well plates, usually 1x10' cells/well (although his umber can be 
optimized. On day 2 cells can be transfected by firstly mixing 0.25 jig DNA in 50 ul 
serum free DMEM/well and 2 ul lipofectamine in 50 ul serumfree DMEMAvell. The 
solutions are gently mixed and incubated for 15-30 min at room temperature. Cells are 
washed with 0.5 ml PBS and 400 ul of serum free media is mixed with the transfection 

10 media and added to the cells. The cells are then incubated for 3-4 hrs at 37°C/5%C0 2 
and then the transfection media is removed and replaced with 1 ml/well of regular growth 
media. On day 3 the cells are labeled with 3 H-myo-inositol. Briefly, the media is 
removed and the cells are washed with 0.5 ml PBS. Then 0.5 ml inositol-free/serum free 
media (GIBCO BRL) is added/well with 0.25 uCi of 3 H-myo-inositol/ well and the cells 

1 5 are incubated for 16-18 hrs o/n at 37°C/5%C0 2 . On Day 4 the cells are washed with 0.5 
ml PBS and 0.45 ml of assay medium is added containing inositol-free/serum free media 
10 uM pargyline 10 mM lithium chloride or 0.4 ml of assay medium and 50ul of lOx 
ketanserin (ket ) to final concentration of 10u,M. The cells are then incubated for 30 mm 
at 37°C. The cells are then washed with 0.5 ml PBSand 200ul of fresh/icecold stop 

20 solution (1M KOH; 18 mM Na-borate; 3.8 mM EDTA) is added/well. The solution is 
kept on ice for 5-10 min or until cells were lysed and then neutralized by 200 ul of 
fresh/ice cold neutralization sol. (7.5 % HCL). The lysate is then transferred into 1.5 ml 
eppendorf tubes and 1 ml of chloroform/methanol (1:2) is added/tube. The solution is 
vortexed for 15 sec and the upper phase is applied to a Biorad AG1-X8™ anion 

25 exchange resin (100-200 mesh). Firstly, the resin is washed with water at 1:1.25 W/V 
and 0.9 ml of upper phase is loaded onto the column. The column is washed with 10 mis 
of 5 mM myo-mositol and 10 ml of 5 mM Na-borate/60mM Na- formate. The inositol 
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tris phosphates are eluted into scintillation vials containing 10 ml of scintillation cocktail 
with 2 ml of 0.1 M formic acid/ 1 M ammonium formate. The columns are regenerated 
by washing with 10 ml of 0.1 M formic acid/3 M ammonium formate and rinsed twice 
with dd H2O and stored at 4°C in water. 
5 Exemplary results are presented below in Table G: 



TABLE G 



Receptor 


Mutation 


Assay 
Utilized 
Figure No.) 


Signal 
Generated: 

CJV1 V 


Signal 
Generated: 
Endogenous 
Version 
(Relative Light 
Units) 


Signal 
Generated: 

Non- 
Endogenous 
Version 
(Relative 
Lignt Units) 


Difference 

(<=0 

Between 
. CMVv. 

Wild-type 
. Wild-type 

v. Mutant 


hRUP12 


N/A 


1P3 

(Figure 1 ) 


317.03 
cpm/mg protein 


3463.29 
cpm/mg protein 




1 . 11 r old 

<= 


hRUPO 


N/A 


cAMP 
( Figure 2 > 


8.06 
pmol/cAMP/mg 
protein 


19.10 
pmol/cAMP/mg 
protein 


-- 


1. 2.4 Fold <^ 


A268K 


8XCRE- 

LUC 
(Figure 3) 


3665.43 
LCPS 


83280.17 
LPCS 


61713.6 
LCPS 


1. 23 Fold 

2. 26 %< 


KRUP14 


L246K 


8XCRE- 

LUC 
(Figure 5 ) 


86.07 
LCPS 


1962.87 
LCPS 


789.73 
LCPS 


1. 23 Fold c= 

2. 60% ( 


hRUP15 


A398K 


8XCRE- 

LUC 
(Figure 6) 


86.07 
LCPS 


18286.77 
LCPS 


17034.83 
LCPS 


1. 212 Fold 

<^ 

2. 1%< 


A398K 


cAMP 
(Figure 7) 


15.00 
pmol/cAMP/mg 
protein 


164.4 

pmol/cAMP/mg 
protein 


117.5 
pmol/cAMP/ 
mg protein 


1. 11 Fold 

2. 29% < 


hRUP17 


N/A 


IP3 
(Figure 9 ) 


317.03 
cpm/mg protein 


741.07 
cpm/mg protein 




1. 2.3 Fold 


hRUP21 


N/A 


IP 3 

(Figure 10) 


730.5 
cpm/mg protein 


1421.9 
cpm/mg protein 




1. 2 Fold <= 


hRlTP23 


W275K 


8XCRE- 
LUC 
(Figure 1 1) 


311.73 
pmol/cAMP/mg 
protein 


13756.00 
pmol/c AMP/ mg 
protein 


9756.87 
pmol/cAMP/ 
mg protein 


1. 44 Fold <= 

2. 30% < 



N/A = not applied 



52 



WO ni/3o4-| P( 171 MHrji5o«) 

Exemplary results of GTPyS assay for detecting constitutive activation, as 
disclosed in Example 4(1) above, was accomplished utilizing Gs:Fusion Protein 
Constructs on human RUP13 and RUP15. Table H below lists the signals generated 
from this assay and the difference in signals as indicated: 



TABLE H 



Receptor; 
Gs Fusion 
Protein 


Assay 
Utilized 


Signal 
Generated: 

CMV 
(cpm bound 

GTP) 


Signal 
Generated: 

Fusion 

Protein 
(cpm bound 
GTP) 


Signal 
Generated: 

CMV+ 
lOuMGDP 
(cpm bound 
GTP) 


Signal 
Generated: 
Fusion 
Protein + 
lOuMGDP 
(cpm bound 
GTP) 


Difference 
Between: 

1. CMV v. Fusion 
Protein 

2. CMV+GDF 

vs. 

Fusion+<;DP 

3. Fusion vs. 
Fusion+GDP 

(cpm bound GTP) 


hRlT13-Gs 


GTPyS 
(Figure 4) 


32494.0 


49351.30 


11 148.30 


28834.67 


1. 1.5 Fold c= 

2. 2.6 Fold w 

3. 42% ( 


hKt T15-Gs 


GTPyS 
(Figure 8) 


30131.67 


32493.67 


7697.00 


14157.33 


1.1.1 Fold <= 

2. 1.8 Fold <= 

3. 56% ( 



Example 5 

Fusion Protein Preparation 

a. GPCRrGs Fusion Constuct 

The design of the constitutively activated GPCR-G protein fusion construct was 
accomplished as follows: both the 5' and 3' ends of the rat G protein Gsa (long form; 
Itoh, H. et aL 83 PNAS 3776 (1986)) were engineered to include a Hindlll (5*- 
AAGCTT-3") sequence thereon. Following confirmation of the correct sequence 
(including the flanking Hindlll sequences), the entire sequence was shuttled into 
pcDNA3.1(-) (Invitrogen, cat. no. V795-20) by subcloning using the HindTTI restriction 
site of that vector. The correct orientation for the Gsa sequence was determined after 
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subcloning into pcDNA3.1(-). The modified pcDNA3.1(-) containing the rat Gsa gene 
at Hindlll sequence was then verified; this vector was now available as a "universal" 
Gsa protein vector. The pcDNA3.1(-) vector contains a variety of well-known 
restriction sites upstream of the Hindlll site, thus beneficially providing the ability to 
5 insert, upstream of the Gs protein, the coding sequence of an endogenous, constitutively 
active GPCR. This same approach can be utilized to create other "universal" G protein 
vectors, and, of course, other commercially available or proprietary vectors known to the 
artisan can be utilized - the important criteria is that the sequence for the GPCR be 
upstream and in-frame with that of the G protein. 
10 RUP13 couples via Gs. For the following exemplary GPCR Fusion Proteins, 

fusion to Gsa was accomplished. 

A RUP13-Gsa Fusion Protein construct was made as follows: primers were 
designed as follows: 

5 ' -gatcfTCTAG AAT] GG AGTCCTC ACCC ATCCCCC AG -3' (SEQ.ID.NO.:97; sense) 
1 5 5 '-gate [G ATATCJCGTGACTCC AGCCGGGGTG AGGCGGC-3 ' (SEQiD.NO.:98; antisense). 

Nucleotides in lower caps are included as spacers in the restriction sites 
(designated in brackets) between the G protein and RUP13. The sense and anti-sense 
primers included the restriction sites for Xbal and EcoRV, respectively, such that spacers 
(attributed to the restriction sites) exists between the G protein and RUP15. 
20 PCR was then utilized to secure the respective receptor sequences for fusion 

within the Gsa universal vector disclosed above, using the following protocol for each: 
lOOng cDNA for RUP15 was added to separate tubes containing 2jil of each primer 
(sense and anti-sense), 3^L of lOmM dNTPs, \0yiL of lOXTaqPlus™ Precision buffer, 
l|iL of TaqPlus™ Precision polymerase (Stratagene: #600211), and 80^L of water. 
25 Reaction temperatures and cycle times for RUP15 were as follows with cycle steps 2 
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through 4 were repeated 35 times: 94°C for 1 min; 94°C for 30 seconds; 62°C for 20 
sec; 72°C 1 min 40see; and 72° C 5 mm PCR product for was run on a 1% agarose 
gel and then purified (data not shown). The purified product was digested with XbaJ and 
EcoRV and the desired inserts purified and ligated into the Gs universal vector at the 
5 respective restriction site. The positive clones was isolated following transformation and 
determined by restriction enzyme digest; expression using 293 cells was accomplished 
following the protocol set forth infra. Each positive clone for RUP15-Gs Fusion Protein 
was sequenced to verify correctness. (See, SEQ.ID.NO.:99 for nucleic acid sequence 
and SEQ.ED.NO. : 1 00 for amino acid sequence ). 
10 RUP15 couples via Gs. For the following exemplary GPCR Fusion Proteins, 

fusion to Gsot was accomplished. 

A RUP15-Gsa Fusion Protein construct was made as follows: primers were 
designed as follows: 

5 '-TCTAG AATGACGTCCACCTGCACCAACAGC-3 * (SEQ.ID.NO.:101; sense) 
15 5 '-gatatcGC AGG AAAAGTAGCAG AATCGTAGG AAG-3 ' (SEQ.ID.NO : 102; antisense). 

Nucleotides in lower caps are included as spacers in the restriction sites between 
the G protein and RUP15. The sense and anti-sense primers included the restriction sites 
for EcoRV and Xbal, respectively, such that spacers (attributed to the restriction sites) 
exists between the G protein and RUP1 5. 
20 PCR was then utilized to secure the respective receptor sequences for fusion 

within the Gsa universal vector disclosed above, using the following protocol for each: 
lOOng cDNA for RUP15 was added to separate tubes containing 2ul of each primer 
(sense and anti-sense), 3uL of lOmM dNTPs, lOuL of lOXTaqPlus™ Precision buffer, 
luL of TaqPlus™ Precision polymerase (Stratagene: #600211), and 80uL of water. 
25 Reaction temperatures and cycle times for RUP15 were as follows with cycle steps 2 
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through 4 were repeated 35 times: 94°C for 1 min; 94°C for 30 seconds; 62°C for 20 
sec; 72°C 1 min 40sec; and 72° C 5 min . PCR product for was run on a 1% agarose 
gel and then purified (data not shown). The purified product was digested ). The 
purified product was digested with EcoRV and Xbal and the desired inserts purified and 
5 ligated into the Gs universal vector at the respective restriction site. The positive clones 
was isolated following transformation and determined by restriction enzyme digest; 
expression using 293 cells was accomplished following the protocol set forth infra. 
Each positive clone for RUP15-Gs Fusion Protein was sequenced to verify correctness. 
(See, SEQ.ID.NO.: 103 for nucleic acid sequence and SEQ.ID.NO.:104 for amino acid 
1 0 sequence ). 

b. Gq(6 amino acid deletion)/Gi Fusion Construct 

The design of a Gq (del)/Gi fusion construct can be accomplished as follows: 
the N-terminal six (6) amino acids (amino acids 2 through 7, having the sequence of 
TLESIM (SEQ.ID.NO.: 129) Gaq-subunit will be deleted and the C-terminal five (5) 
15 amino acids, having the sequence EYNLV (SEQ.ID.NO.: 130) will be replace with the 
corresponding amino acids of the Gcri Protein, having the sequence DCGLF 
(SEQ.ID.NO.: 131). This fusion construct will be obtained by PCR using the following 
primers: 

5 , -gatcaagcttcCATGGCGTGCTGCCTGAGCGAGGAG-3 , (SEQ.ID.NO.: 132) and 

20 

5 , -gatcggatccTTAGAACAGGCCGCAGTCC^TCAGGTTCAGCTGCAGGATGGTG-3 , 
(SEQ.ID.NO.: 133) 

and Plasmid 63313 which contains the mouse Gotq-wild type version with a 
25 hemagglutinin tag as template. Nucleotides in lower caps are included as spacers. 

TaqPlus Precision DNA polymerase (Stratagene) will be utilized for the 
amplification by the following cycles, with steps 2 through 4 repeated 35 times: 95°C 

56 
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for 2 min; 95°C for 20 sec; 56°C for 20 sec; 72°C for 2 min; and 72°C for 7 min. The 
PCR product will be cloned into a pCRII-TOPO vector (Invitrogen) and sequenced 
using the ABI Big Dye Terminator kit (P.E. Biosystem). Inserts from a TOPO clone 
containing the sequence of the fusion construct will be shuttled into the expression 
5 vector pcDNA3. 1 (+) at the Hindlll/BamHI site by a 2 step cloning process. 
Example 6 

Tissue Distribution of the disclosed human GPCRs: RT-PCR 

RT-PCR was applied to confirm the expression and to determine the tissue 
10 distribution of several novel human GPCRs. Oligonucleotides utilized were GPCR- 
specific and the human multiple tissue cDNA panels (MTC, Clontech) as templates. 
Taq DNA polymerase (Stratagene) were utilized for the amplification in a 40uJ 
reaction according to the manufacturer's instructions. 20fil of the reaction will be 
loaded on a 1 .5% agarose gel to analyze the RT-PCR products. Table J below lists the 
1 5 receptors, the cycle conditions and the primers utizilized. 



TABLE J 



Receptor 
Identifier 


Cycle 
Conditions 
Min ( fc ). Sec (") 
Cycles 2-4 
repeated 30 
times 


5" Primer 
(SEQ.ID.NO.) 


3' Primer 
(SEQ.ID.NO.) 


DNA Fragment 


Tissue 
Expression 


hRUPlO 


94° for 30" 
94° for 10" 
62°C for 20" 
72° for 1 ' 
72° for 1" 
*cycles 2-4 
repeated 35 times 


CATGTATGC 
CAGCGTCCT 
GCTCC (105) 


GCTATGCCTG 

AAGCCAGTC 

TTGTG(106) 


730bp 


Kidney, 
leukocyte, liver, 
placenta and 
spleen 


hRUPll 


94° for 2 ' 
94° for 15" 
67°C for 15" 
72° for 45" 
72° for 5' 


GCACCTGCT 
CCTGAGCAC 

CTTCTCC 
(107) 


CACAGCGCT 
GCAGCCCTG 
CAGCTGGC 
(108) 


630bp 


Liver, kidney, 
pancreas, colon, 
small intestinal, 
spleen and 
prostate 
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hRUP12 


94° for 2' 


ccagtgatg 


CAGACACIT 


490bp 


Brain, colon, 




94° for 15" 


ACTCTGTCC 


GGCAGGGAC 




heart, kidney, 




66°C for 15" 


AGCCTG(109) 


GAGGTG (110) 




leukocyte, 




72° for 45" 








pancreas, 




72° for 5* 








prostate, small 

intestinal, 
spleen, testis, 
and tliymus 
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1 HRIT13 


04° tor 1 ' 
*>4 n for 15" 
oS°C for 20" 
"2° foi r 45" 

72° for 5 1 


CTTGTGGTCT 
ACTGCAGCA 
TGTTCCG 
(Ml) 


CATATCCCTC 
CGAGTGTCC 
AGCGGC (112) 


700bp 


Placenta and 
lung 


hRl T14 


W for 1 1 
04° for 15" 
<>8°C for 20" 
72° for 1M5" 
72° for 5' 


AIGGATCCr 
TATCATGGC 

ticctc (i 13) 


CAAGAACAG 
GTCTCATCTA 
AGAGCTCC 
(114) 


7()0bp 


Not yet 
determined 


hRUP16 


04° for 30" 
94° for 5" 
oQ°c for 15" 
72° for 30" 
72° for 5' 


CTCTGATGC 
CATCTGCTG 
GATTCCTG 
(115) 


GTAGTCCACT 
GAAAGTCCA 
GTGATCC 
(116) 


370bp 


Fetal brain, fetal 
kidney and fetal 
skeletal muscle 


hRUP18 


04° for T 
94° for 15" 
60°C for 20" 
72° for T 
72° for 5 ' 


TGGTGGCGA 
TGGCCAACA 
GCGCTC(1 17) 


GTTGCGCCTP 
AGCGACAGA 
TGACC(1 18) 


330bp 


Pancreas 


hRUP21 


94° for 1 ' 
94° for 15" 
56 V C for 20" 
72° for 40" 
*cycles 2-3 
repeated 30 times 


TCAACCTGT 
ATAGCAGCA 
TCCTC(119) 


AAGGAGTAG 
CAGAATGGT 
TAGCC(120) 




Kidney, lung 
and testis 


hRUP22 


94° for 30" 
94° for 15" 
6°°C for 20" 
^2° for 40" 
"cycles 2-3 
repeated 30 times 


GACACCTGT 
CAGCGGTCG 
TGTGTG (121) 


CTGATGGAA 
GTAGAGGCT 
GTCCATCTC 
(122) 




Testis, thymus ! 
and spleen 


HRUP23 


94° for 2' 
94° for 15" 
60°C for 20" 
72° for 1 1 
72° for 5' 


GCGCTGAGC 
GCAGACCAG 
TGGCTG(123) 


CACGGTGAC 
GAAGGGCAC 
GAGCTC(124) 


520bp 


Placenta 


hRUP26 


94° for 2' 
04° for 15" 
65 °C for 20" 
72° for r 
72° for 5' 


AGCCATCCC 
TGCCAGGAA 
GCATGG(125) 


CCAGGTAGG 
TGTGCAGCA 
CAATGGC 
(126) 


470bp 


Pancreas 


hRUP27 


04° for 30" 
04° for 10" 
55°C for 20" 
72° for 1 1 
72° for 3* 
*cycles 2-4 
repeated 35 times 


CTGTTCAAC 
AGGGCTGGT 
TGGCAAC 
(127) 


ATCATGTCTA 
GACTCATGGT 
GATCC(128) 


890bp 


Brain 
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Example 7 

Protocol: Direct Identification of Inverse Agonists and Agonists 
A. [ 35 S]GTPyS Assay 

Although we have utilized endogenous, constitutively active GPCRs for the 
direct identification of candidate compounds as, e.g., inverse agonists, for reasons that 
are not altogether understood, intra-assay variation can become exacerbated. 
Preferably, then, a GPCR Fusion Protein, as disclosed above, is also utilized with a non- 
endogenous, constitutively activated GPCR. We have determined that when such a 
protein is used, intra-assay variation appears to be substantially stabilized, whereby an 
effective signal-to-noise ratio is obtained. This has the beneficial result of allowing for a 
more robust identification of candidate compounds. Thus, it is preferred that for direct 
identification, a GPCR Fusion Protein be used and that when utilized, the following 
assay protocols be utilized. 

1. Membrane Preparation 

Membranes comprising the constitutively active orphan GPCR Fusion Protein of 
interest and for use in the direct identification of candidate compounds as inverse 
agonists, agonists or partial agonists are preferably prepared as follows: 

a. Materials 

"Membrane Scrape Buffer" is comprised of 20mM HEPES and lOmM EDTA, 
pH 7.4; "Membrane Wash Buffer" is comprised of 20 mM HEPES and 0.1 mM 
EDTA, pH 7.4; "Binding Buffer" is comprised of 20mM HEPES, 100 mM NaCl, and 
lOmM MgCl 2 ,pH7.4 

b. Procedure 

All materials will be kept on ice throughout the procedure. Firstly, the media 
will be aspirated from a confluent monolayer of cells, followed by rinse with 10ml cold 

60 
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PBS, followed by aspiration. Thereafter, 5ml of Membrane Scrape Buffer will be added 
to scrape cells; this will be followed by transfer of cellular extract into 5 Oml centrifuge 
tubes (centrifuged at 20,000 rpm for 17 minutes at 4°C). Thereafter, the supernatant will 
be aspirated and the pellet will be resuspended in 30ml Membrane Wash Buffer 
5 followed by centrifuge at 20,000 rpm for 17 minutes at 4°C. The supernatant will then 
be aspirated and the pellet resuspended in Binding Buffer. This will then be 
homogenized using a Brmkman polytron™ homogenizer (15-20 second bursts until the 
all material is in suspension). This is referred to herein as "Membrane Protein". 
2. Bradford Protein Assay 

10 Following the homogenization, protein concentration of the membranes will 

be determined using the Bradford Protein Assay (protein can be diluted to about 
1.5mg/ml, aliquoted and frozen (-80°C) for later use; when frozen, protocol for use 
will be as follows: on the day of the assay, frozen Membrane Protein is thawed at 
room temperature, followed by vortex and then homogenized with a polytron at about 

15 12 x 1,000 rpm for about 5-10 seconds; it was noted that for multiple preparations, the 
homogenizor should be thoroughly cleaned between homoginezation of different 
preparations). 

a. Materials 

Binding Buffer (as per above); Bradford Dye Reagent; Bradford Protein 
20 Standard will be utilized, following manufacturer instructions (Biorad, cat. no. 500- 
0006). 

b. Procedure 

Duplicate tubes will be prepared, one including the membrane, and one as a 
control "blank". Each contained 800ul Binding Buffer. Thereafter, 10|il of Bradford 
25 Protein Standard (lmg/ml) will be added to each tube, and lOjil of membrane Protein 
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will then be added to just one tube (not the blank). Thereafter, 200ul of Bradford Dye 
Reagent will be added to each tube, followed by vortex of each. After five (5) 
minutes, the tubes will be re-vortexed and the material therein will be transferred to 
cuvettes. The cuvettes will then be read using a CECIL 3041 spectrophotometer, at 
5 wavelength 595. 

3. Direct Identification Assay 

a. Materials 

GDP Buffer consisted of 37.5 ml Binding Buffer and 2mg GDP (Sigma, cat, no, 
G-7127), followed by a series of dilutions in Binding Buffer to obtain 0.2 jaM GDP 
10 (final concentration of GDP in each well was 0.1 |iM GDP); each well comprising a 
candidate compound, has a final volume of 200ul consisting of 100|j.l GDP Buffer (final 
concentration, 0.1 |iM GDP), 50ul Membrane Protein in Binding Buffer, and 50jal 
[ 35 S]GTPyS (0.6 nM) in Binding Buffer (2.5 \x] [ 35 S]GTPyS per 10ml Binding Buffer). 

b. Procedure 

15 Candidate compounds will be preferably screened using a 96-well plate format 

(these can be frozen at -80°C). Membrane Protein (or membranes with expression 
vector excluding the GPCR Fusion Protein, as control), will be homogenized briefly 
until in suspension. Protein concentration will then be determined using the Bradford 
Protein Assay set forth above. Membrane Protein (and control) will then be diluted to 

20 0.25mg/ml in Binding Buffer (final assay concentration, 1 2.5 ji g/well). Thereafter, 100 
(il GDP Buffer was added to each well of a Wallac Scintistrip™ (Wallac). A 5ul pin- 
tool will then be used to transfer 5 |il of a candidate compound into such well (i.e., 5(il in 
total assay volume of 200 jal is a 1:40 ratio such that the final screening concentration of 
the candidate compound is 10|iM). Again, to avoid contamination, after each transfer 

25 step the pin tool should be rinsed in three reservoirs comprising water (IX), ethanol (IX) 
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and water (2X) - excess liquid should be shaken from the tool after each nnse and dried 
w ith paper and kimwipes. Thereafter, 50 pi of Membrane Protein will be added to each 
well (a control well comprising membranes without the GPCR Fusion Protein was also 
utilized), and pre-incubated for 5-10 minutes at room temperature. Thereafter, 50pl of 
5 ["S]GTPyS (06 nM) in Binding Buffer will be added to each well, followed by 
incubation on a shaker for 60 minutes at room temperature (again, in this example, plates 
were covered with foil). The assay will then be stopped by spinning of the plates at 4000 
RPM for 15 minutes at 22°C. The plates will then be aspirated with an 8 channel 
manifold and sealed with plate covers. The plates will then be read on a Wallace 1450 
1 0 using setting "Prot. #37" (as per manufacturer instructions). 
B. Cyclic AMP Assay 

Another assay approach to directly identified candidate compound was 
accomplished by utilizing a cyclase-based assay. In addition to direct identification, this 
1 5 assay approach can be utilized as an independent approach to provide confirmation of 
the results from the [ 3> S]GTPyS approach as set forth above. 

A modified Flash Plate™ Adenylyl Cyclase kit (New England Nuclear; Cat. No. 
SMP004A) was preferably utilized for direct identification of candidate compounds as 
inverse agonists and agonists to constitutively activated orphan GPCRs in accordance 
20 with the following protocol. 

Transfected cells were harvested approximately three days after transfection. 
Membranes were prepared by homogenization of suspended cells in buffer containing 
20mM HEPES, pH 7.4 and lOmM MgCk Homogenization was performed on ice using 
a Brinkman Polytron™ for approximately 10 seconds. The resulting homogenate is 
25 centnfuged at 49,000 X g for 15 minutes at 4°C. The resulting pellet was then 
resuspended in buffer containing 20mM HEPES, pH 7.4 and 0.1 mM EDTA, 
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homogenized for 10 seconds, followed by centrifugation at 49,000 X g for 1 5 minutes at 
4°C. The resulting pellet was then stored at -80°C until utilized. On the day of direct 
identification screening, the membrane pellet as slowly thawed at room temperature, 
resuspended in buffer containing 20mM HEPES, pH 7.4 and lOmM MgCL2, to yield a 
5 final protein concentration of 0.60mg/ml (the resuspended membranes are placed on ice 
until use). 

cAMP standards and Detection Buffer (comprising 2 (iCi of tracer [ 125 I cAMP 

(100 jaI] to 11 ml Detection Buffer) were prepared and maintained in accordance with 

the manufacturer's instructions. Assay Buffer was prepared fresh for screening and 
10 contained 20mM HEPES, pH 7.4, lOmM MgCl 2 , 20mM phospocreatine (Sigma), 0.1 

units/ml creatine phosphokinase (Sigma), 50 jiM GTP (Sigma), and 0.2 mM ATP 

(Sigma); Assay Buffer was then stored on ice until utilized. 

Candidate compounds identified as per above (if frozen, thawed at room 

temperature) were added, preferably, to 96-well plate wells (3(il/well; 12(iM final assay 
15 concentration), together with 40 \i\ Membrane Protein (30jig/well) and 50jil of Assay 

Buffer. This admixture was then incubated for 30 minutes at room temperature, with 

gentle shaking. 

Following the incubation, 100|il of Detection Buffer was added to each well, 
followed by incubation for 2-24 hours. Plates were then counted in a Wallac 
20 MicroBeta™ plate reader using "Prot. #3 1 " (as per manufacturer instructions). 

A representative screening assay plate (96 well format) result is presented in 
Figure 12. Each bar represents the results for a different compound in each well, plus 
RUP13-Gsa Fusion Protein construct, as prepared in Example 5(a) above. The 
representative results presented in Figure 12 also provide standard deviations based upon 
25 the mean results of each plate ("m") and the mean plus two arbitrary preference for 
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selection of inverse agonists as "leads" from the primary screen involves selection of 
candidate compounds that that reduce the per cent response by at least the mean plate 
response, minus two standard deviations. Conversely, an arbitrary preference for 
selection of an agonists as "leads" from the primary screen involves selection of 
5 candidate compounds that increase the per cent response by at least the mean plate 
response, plus the two standard deviations. Based upon these selection processes, the 
candidate compounds in the following wells were directly identified as putative inverse 
agonist (Compound A) and agonist (Compound B) to RUP13 in wells A2 and G9, 
respectively. See, Figure 12. It is noted for clarity: these compounds have been directly 

10 identified without any knowledge of the endogenous ligand for this GPCR. By focusing 
on assay techniques that are based upon receptor function, and not compound binding 
affinity, we are able to ascertain compounds that are able to reduce the functional 
activity of this receptor (Compound A) as well as increase the functional activity of the 
receptor (Compound B). Based upon the location of these receptor in lung tissue (see, 

15 for example, hRUP13 and hRUP21 in Example 6), pharmaceutical agents can be 
developed for potential therapeutic treatment of lung cancer. 

References cited throughout this patent document, including co-pending and 
related patent applications, unless otherwise indicated, are fully incorporated herein by 
reference. Modifications and extension of the disclosed inventions that are within the 

20 purview of the skilled artisan are encompassed within the above disclosure and the 
claims that follow. 

Although a variety of expression vectors are available to those in the art, for 
purposes of utilization for both the endogenous and non-endogenous human GPCRs, it is 
most preferred that the vector utilized be pCMV. This vector was deposited with the 
25 American Type Culture Collection (ATCC) on October 13, 1998 (10801 University 
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Blvd., Manassas, VA 201 10-2209 USA) under the provisions of the Budapest Treaty for 
the International Recognition of the Deposit of Microorganisms for the Purpose of Patent 
Procedure. The DNA was tested by the ATCC and determined to be viable. The ATCC 
has assigned the following deposit number to pCMV: ATCC #20335 1 . 
5 // 
// 
// 
// 
// 

10 // 

// 
// 
// 
// 

15 // 

// 
// 
// 
// 

20 // 
// 
// 
// 
// 

25 // 
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CLAIMS 

What is claimed is: 

1 . AG protein-coupled receptor encoded by an amino acid sequence of 
SEQ.ID.NO.:2. 

2. A non-endogenous, constitutively activated version of the G protein-coupled 
receptor of claim 1 . 

3. A plasmid comprising a vector and the cDNA of SEQ.lD.NO.:l. 

4. A host cell comprising the plasmid of claim 3, 

5. AG protein-coupled receptor encoded by an amino acid sequence of 
SEQ.ID.NO.:4. 

6. A non-endogenous, constitutively activated version of the G protein-coupled 
receptor of claim 5. 

7. A plasmid comprising a vector and the cDNA of SEQ.ID.NO.:3. 

8. A host cell comprising the plasmid of claim 7. 

9. AG protein-coupled receptor encoded by an amino acid sequence of 
SEQ.ID.NO.:6. 

10. A non-endogenous, constitutively activated version of the G protein-coupled 
receptor of claim 9. 

1 1 . A plasmid comprising a vector and the cDNA of SEQ.ID.NO.:5. 

12. A host cell comprising the plasmid of claim 1 1. 

1 3. A G protein-coupled receptor encoded by an amino acid sequence of 
SEQ.ID.NO.:8. 

14. A non-endogenous, constitutively activated version of the G protein-coupled 
receptor of claim 13. 

67 



WO 01/3(i47| |>( l /l SOO/31509 

15. A plasmid comprising a vector and the cDNA of SEQ.ID.NO. ;7. 

16. A host cell comprising the plasmid of claim 15. 

17. A G protein-coupled receptor encoded by an amino acid sequence of 
SEQ.ID.NO.:10. 

5 18. A non-endogenous, constitutively activated version of the G protein-coupled 

receptor of claim 17. 

19. A plasmid comprising a vector and the cDNA of SEQ.ID.NO.:9. 

20. A host cell comprising the plasmid of claim 19. 

21 . A G protein-coupled receptor encoded by an amino acid sequence of 
10 SEQ.ED.NO.:12. 

22. A non-endogenous, constitutively activated version of the G protein-coupled 
receptor of claim 21 comprising an amino acid sequence of SEQ.ID.NO. 84. 

23. A plasmid comprising a vector and the cDNA of SEQ.ID.NO.: 1 1 . 

24. A host cell comprising the plasmid of claim 23. 

15 25. A G protein-coupled receptor encoded by an amino acid sequence of 

SEQ.ID.NO.:14. 

26. A non-endogenous, constitutively activated version of the G protein-coupled 
receptor of claim 25 comprising an amino acid sequence of SEQ.ID.NO. 88. 

27. A plasmid comprising a vector and the cDNA of SEQ.ID.NO.: 13. 
20 28. A host cell comprising the plasmid of claim 27. 

29. A G protein-coupled receptor encoded by an amino acid sequence of 
SEQ.ID.NO.: 16. 

30. A non-endogenous, constitutively activated version of the G protein-coupled 
receptor of claim 29 comprising an amino acid sequence of SEQ.ID.NO. :92. 

25 31. A plasmid comprising a vector and the cDNA of SEQ.ID.NO.: 15. 
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32. A host cell comprising the plasmid of claim 31. 

33. AG protein-coupled receptor encoded by an amino acid sequence of 
SEQ.ID.NO.:18. 

34. A non-endogenous, constitutively activated version of the G protein-coupled 
5 receptor of clajm 33. 

35. A plasmid comprising a vector and the cDNA of SEQ.ID.NO.:l 7. 

36. A host cell comprising the plasmid of claim 35. 

37. A G protein-coupled receptor encoded by an amino acid sequence of 
SEQ.ID.NO.:20. 

1 0 38. A non-endogenous, constitutively activated version of the G protein-coupled 

receptor of claim 37. 

39. A plasmid comprising a vector and the cDNA of SE. ID. NO. :19. 

40. A host cell comprising the plasmid of claim 39. 

41 . A G protein-coupled receptor encoded by an amino acid sequence of 
15 SEQ.ID.NO.:22. 

42. A non-endogenous, constitutively activated version of the G protein-coupled 
receptor of claim 41 . 

43. A plasmid comprising a vector and the cDNA of SEQ.ID.NO.:21. 

44. A host cell comprising the plasmid of claim 43. 

20 45. A G protein-coupled receptor encoded by an amino acid sequence of 

SEQ.ID.NO.:24. 

46. A non-endogenous, constitutively activated version of the G protein-coupled 
receptor of claim 45. 

47. A plasmid comprising a vector and the cDNA of SEQ.ID.NO.:23. 
25 48. A host cell comprising the plasmid of claim 47. 
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49. A G protein-coupled receptor encoded by an amino acid sequence of 
SEQ.ID.NO.:26. 

50. A non-endogenous, constitutively activated version of the G protein-coupled 
receptor of claim 49. 

5 51. A plasmid comprising a vector and the cDNA of SEQ.ID.NO.:25. 

52. A host cell comprising the plasmid of claim 51. 

53. A G protein-coupled receptor encoded by an amino acid sequence of 
SEQ.TD.NO.:28. 

54. A non-endogenous, constitutively activated version of the G protein-coupled 
10 receptor of claim 53. 

55. A plasmid comprising a vector and the cDNA of SEQ.ID.NO.:27. 

56. A host cell comprising the plasmid of claim 55. 

57. A G protein-coupled receptor encoded by an amino acid sequence of 
SEQ.ID.NO.:30. 

15 58. A non-endogenous, constitutively activated version of the G protein-coupled 

receptor of claim 57. 

59. A plasmid comprising a vector and the cDNA of SEQ.ID.NO.:29. 

60. A host cell comprising the plasmid of claim 59. 

61 . A G protein-coupled receptor encoded by an amino acid sequence of 
20 SEQ.ID.NO.:32. 

62. A non-endogenous, constitutively activated version of the G protein-coupled 
receptor of claim 61 comprising an amino acid sequence of SEQ.ID.NO.:96. 

63. A plasmid comprising a vector and the cDNA of SEQ.ED.NO.:95. 

64. A host cell comprising the plasmid of claim 63. 
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65. A G protein-coupled receptor encoded by an amino acid sequence of 
SEQ.ID.NO.:34. 

66. A non-endogenous, constitutively activated version of the G protein-coupled 
receptor of claim 65. 

67. A plasmid composing a vector and the cDNA of SEQ.lD.NO.:33. 

68. A host cell comprising the plasmid of claim 67. 

69. A G protein-coupled receptor encoded by an amino acid sequence of 
SEQ.ID.NO.:36. 

70. A non-endogenous, constitutively activated version of the G protein-coupled 
receptor of claim 69. 

71 . A plasmid comprising a vector and the cDNA of SEQ.ID.NO.:35. 

72. A host cell comprising the plasmid of claim 71. 

73. A G protein-coupled receptor encoded by an amino acid sequence of 
SEQ.ID.NO.:38. 

74. A non-endogenous, constitutively activated version of the G protein-coupled 
receptor of claim 73. 

75. A plasmid comprising a vector and the cDNA of SEQ.lD.NO.:37. 

76. A host cell comprising the plasmid of claim 75. 

77. A G protein-coupled receptor encoded by an amino acid sequence of 
SEQ.ID.NO.:40. 

78. A non-endogenous, constitutively activated version of the G protein-coupled 
receptor of claim 77. 

79. A plasmid comprising a vector and the cDNA of SEQ.ID.NO.:39. 

80. A host cell comprising the plasmid of claim 79. 
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■itq^ragece agaatggaaa ca c cagt t 1 acacccaact ttaatccacc 

gcct:ctcr:c tctccrttaa c t L r a g 1 1 a t ggtgattatg acctccctat 

gaggacatga ccaagacccg gac:ttcttc geagecaaga tegtcattgg 

qcaggcatoa tgctggtetg eggcateggt aactttgtct ttatcgctgc 

tata-jgaaqt tgcgcaacct cac-::aat.ctq ci:attgcca acctggccat 

c tggt ggeca teatctgetg ccccttcgag atggactact acgtggcacq 

tqggaqca-g geraegtget ctg*:gcc:c: gtcaactacc tgcgcaccg: 

gtct z caeca atgecttget ggccattqc:; attgacagat atctcgccat 

rtgaaaccac ggatg^atta r.caaa:qgcc tccttcctga tcgccttggt 

tccattctca ttgccatccc at:ggci.La-: rttgcaacag aaaeggtect 

aagagecagg agaagatctt etqtggccaq atctggcctg tggatcagca 

aagtrctact tcctcitcat cttrggtgr.'.: gagttcgtgg gccctgtggt 

ctgtgctarg ccaggatctc ccgggagctc tggttcaagg cagtccctgg 

gagc^gattc geaagegget qcgrtgccgc agqaaqaegg tecrggtget. 

otca':ggcct atgtgctgtg ctgqgcaccc ttJtacggtt tcacca t cgt 

-tecicactg tgttcgtqaa qgaaaaqcac ta^ctcactg cct:ctacg; 

ci t. c 3 : c a r q a gcaa c aqcat gat :: a a c a c c c z qtqqttcg t g a c g g t c a a 

atgaagtaot tcaagaagat gatgetgetq ca-::ggcgtc cctcccagcg 

t ;*caq tgc*:g acrttqacct cagaaccaa: ggggtq-cca ccacagaaga 

atcaigctga agtga 



:aagaccat 
_j g a z q a g g a t 
cattgeactg 
:ct ca cccgc 
:tccgacttc 
gcag:tctc: 

tcc::tc:a: 
:gt::accc; 

tggatggtg 
ctt:attgtc 
get : tact a c 
:ac:Jtgacc 
gtt::agacq 
c a t g t g c a 1 1 
tegtgactte 
ggtcqagt gc 
2.iaz.^ncazc 
g gggagcaag 
qgtggact it 



180 

2 4 0 

300 

3 60 

4 20 
4 8 0 
0 4 !.■ 
60 0 

6 60 

7 2 0 

7 80 

8 4 0 
0 0 0 
0 6 0 

1023 
1080 
114 0 
1 150 



: i 0 2 
:il> 384 



Paae 



W O 01/3M71 PC I 71 SIMI/31509 



<2 12> PKT 

<213> Homo sapiens 

<400> 2 

Met Ala Ala Gin Asn Gly Asn Thr Ser Phe Thr Pro Asn Phe Asn Pro 

1 5 10 lb 

Pro Gin Asp His Ala Ser Ser Leu Ser Phe Asn Phe Ser Tyr Gly Asp 

20 25 30 

Tyr Asp Leu Pro Met Asp Glu Asp Glu Asp Met Thr Lys Thr Arg Thr 
3 5 4 0 4 5 

Phe Phe Ala Ala Lys lie Val lie Gly lie Ala Leu Ala Gly lie Met 
50 5 5 60 

Leu Val Cys Gly lie Gly Asn Phe Val Phe lie Ala Ala Leu Thr Arg 

65 70 75 80 

Tyr Lys Lys Leu Arg Asn Leu Thr Asn Leu Leu lie Ala Asn Leu A, la 
85 90 95 

lie Ser Asp Phe Leu Val Ala lie lie Cys Cys Pro Phe Glu Met Asp 
100 105 110 

Tyr Tyr Val Val Arg Gin Leu Ser Trp Glu His Gly His Val Leu Cys 

115 120 125 

Ala Ser Val Asn Tyr Leu Arg Thr Val Ser Leu Tyr Val Ser Thr Asn 
130 135 140 

Ala Leu Leu Ala lie Ala lie Asp Arg Tyr Leu Ala lie Val His Pro 
145 150 155 160 

Leu Lys Pro Arg Met Asn Tyr Gin Thr Ala Ser Phe Leu lie Ala Leu 
165 170 175 

Val Trp Met Val Ser lie Leu lie Ala lie Pro Ser Ala Tyr Phe Ala 
180 185 190 

Thr Glu Thr Val Leu Phe He Val Lys Ser Gin Glu Lys lie Phe Cys 
195 200 205 

Gly Gin He Trp Pro Val Asp Gin Gin Leu Tyr Tyr Lys Ser Tyr Phe 
210 215 220 

Leu Phe He Phe Gly Val Glu Phe Val Gly Pro Val Val Thr Met Thr 

225 230 235 2 40 

Leu Cys Tyr Ala Arg He Ser Arg Glu Leu Trp Phe Lys Ala Val Pro 

245 250 255 

Gly Phe Gin Thr Glu Gin lie Arg Lys Arg Leu Arg Cys Arg Arg Lys 
260 265 270 

Thr Val Leu Val Leu Met Cys He Leu Thr Ala Tyr Val Leu Cys Trp 
275 280 285 

Ala Pro Phe Tyr Gly Phe Thr He Val Arg Asp Phe Phe Pro Thr Val 

290 295 300 

Phe Val Lys Glu Lys His Tyr Leu Thr Ala Phe Tyr Val Val Glu Cys 
305 310 315 320 



Page 3 



\\ () 0 1 3<>-Tt 



P( I I sou 3150*> 



I 1 o Ala Ser A: i. r M<-t Ilo As:i Thi Vn 1 Cys f; . t.- V.jI Tm V:.] 

Lye Asn Asn Thr Mot I ys Ty: Phe Lys Lys Met Mt.-t Lui: Lou H i .v. Trp 
«. 0 0 ' < -1 : < c _ 0 

Arg Pro :!et G 1 : . A : g G y S r Lys S e r S er A 1 a A s p L e u A s p L^ c A r. g 

•. ; : S x. : 3 t L 

Thr Asn Cly Val Pi " Thr Thr Glu Glu Val Asp Cys I le /Arc; Lou Lys 
37 0 " 37 5 ittO 

<210 ■ 3 

<2I1* 1160 

<212> DNA 

<:213> Homo sapiens 

,:i tgcLgqcag c t g r c 1 t r_ g o dqactictaac t c c a g c a g c a t g a a t q t g tr z t t tgct c a c ^ 0 

ctcc.ictr.tg ccggagggta cotgocctct gatr.cccagg actqqaqaac rrat ca t cccg 12 0 

gctcecttcg rgg^tqtctg cctggtgggc ttrgtgggaa acctgtgtqt gattggcatc 180 

ctccttcaca atgcttggaa aggaaagcca tccatgatcc actccctgat tctgaatctc 24 0 

a g c c r g g c t g a:ctctc:c: c c t g c t g t z t tcigcaccta tccgagctac g g c g t a c t c c 3 00 

aaaagtgttt qgqatctagg ctggtttgtc tgcaagtcct ctgactggr. t rate :aca ca 3t'-,0 

tqcatggcag ccaaq-igcc: gaeaategtr gtggtggcca aagtatgett zatqtatgea 420 

agtgacccag ccaaccaayt gaqtatcca: aactacacca tctggtcagt gctggtggcc 4 PC 

at.ctggactg tgg:tag:ci gttacccctg ccggaatggt tctttagcac catcaggcat 54 C 

catgaaggtg tggaaatgt.g cctcgtggat gtaccagctg tgg:tgaaga gtttatgteg 6o0 

atgtttgqta agc:cca:c: aci;ctggca tttggcc:tc cat tattt t t :gc:agcttt h*>0 

t.at.ttctgqa gagrttatga ccaatgtaaa aaacgaggaa ctaagactca aaatottaga 720 

aaoc.jgatac gct:aaagc3 agt:acagtg at get get ga gca:tg:cat latctctqct '-Ho 

■::tcttgtgac tccccgaatg ggtagcttgg ctgtgggtat ggcatctgaa ggct: gcaggc SAO 

•cqgccc:ac cacaaggttt :atagccc:g tetcaagtct tgatgttttc :at:::t:ca 9:)0 

ocaaatcctc tcattvtr.ct tgtgatqrcg gaagagttca gggaaggett gaaaggtqta 0n0 

T cjgaa a t g ca tqataacian aaaacctcca actgtitiag agtctcagga aa :a:cagct 102"' 

jqca^ctc&c aqqgtrttce tga:aaggtt ccatcLccag aat" :c :c:agc a t ::c 3 1 acca IOhO 

gaaaaagaca aaccc jg:'tc tccrtcctct ggcaaagqga aaaetgagaa ggcagaqatt 1 1 40 

,:c-:a:c;ttc ctgaegtaga gragtttrgg catgagaqgg aca:agtc:c ttctqtacag 1200 

gacaatgacc ctatcccctg ggaacatgaa gatcaagaga caggggaagg tgttaaatag I2tj0 

■210> 4 

<211- 419 

<:2I2^ PKT 

< 2 1 3 . » H omo sapiens 

Paae 4 



WO (H/364^1 P(T/rSO«l/3150«> 



' H 0 J > 

.Me: Leu Ala Ma Ala P h e Ala Asp Ser A s n S e r S e r Ser Mot. A s n V a 1 

i 5 ' 10 15 

Ser Phe Ala His Leu His Phe Ala Gly Gly Tyr Leu Pro Ser Asp Ser 

20 25 30 

Gin A.sp Trp Arg Thr lie lie Pro Ala Leu Leu Val Ala Val Cys Leu 
3 5 4 0 4 5 

Val Gly Phe Val Gly Asn Leu Cys Val lie Gly lie Leu Leu His Asn 
50 55 60 

Ala Trp Lys Gly Lys Pro Ser Met lie His Ser Leu lie Leu Asn Leu 

65 7 0 7 5 8 0 

Ser Leu Ala Asp Leu Ser Leu Leu Leu Phe Ser Ala Pro lie Arg Ala 
85 90 95 

Thr Ala Tyr Ser Lys Ser Val Trp Asp Leu Gly Trp Phe Val Cys Lys 

100 105 11C 

Ser Ser Asp Trp Phe lie His Thr Cys Met Ala Ala Lys Ser Leu Thr 
115 120 125 

lie Val Val Val Ala Lys Val Cys Phe Met Tyr Ala Ser Asp Pro Ala 
130 135 140 

Lys Gin Val Ser Tie His Asn Tyr Thr lie Trp Ser Val Leu Val Ala 
145 150 155 160 

He Trp Thr Val Ala Ser Leu Leu Pro Leu Pro Glu Trp Phe Phe Ser 
165 170 175 

Thr He Arq His His Glu Gly Val Glu Met Cys Leu Val Asp Val Pro 
180 185 190 

Ala Val Ala Glu Glu Phe Met Ser Met Phe Gly Lys Leu Tyr Pro Leu 
195 200 ' ' 205 

Leu Ala Phe Gly Leu Pro Leu Phe Phe Ala Ser Phe Tyr Phe Trp Arg 

210 215 220 

Ala Tyr Asp Gin Cys Lys Lys Arg Gly Thr Lys Thr Gin Asn Leu Arg 
225 230 235 240 

Asn Gin He Arg Ser Lys Gin Val Thr Val Met Leu Leu Ser He Ala 
245 250 255 

He He Ser Ala Leu Leu Trp Leu Pro Glu Trp Val Ala Trp Leu Trp 

260 265 270 

Val Trp His Leu Lys Ala Ala Gly Pro Ala Pro Pro Gin Gly Phe He 

275 280 285 

Ala Leu Ser Gin Val Leu Met Phe Ser He Ser Ser Ala Asn Pro Leu 

290 295 300 

He Phe Leu Val Met Ser Glu Glu Phe Arg Glu Gly Leu Lys Gly Val 
305 310 315 320 

Trp Lys Trp Met He Thr Lys Lys Pro Pro Thr Val Ser Glu Ser Gin 

325 330 335 
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WO 0 1 3<>-TI 



IH I I SOI > 3 1 50*) 



;iu : :.r } i ■ 



ily Asn Sor Glu Gly Leu Pro Asp Lys Val Pro Ser 



Pro 



'rc A 1 a Ser _ .1 c- Fro Glu Lys Glu Lys Pro 5er Se: 

3 60 3-0 



Ser Ser Gly Lys Gly Lys Thr Glu Lys Ala Glu lie Prs lie Leu Pro 

37 0 ~ VP- 3H0 

Asp Val Glu Gin Phe Trp His Glu Arg Asp Thr Val Pro Ser Val Gin 

J /db 390 395 400 

Asp Asn Asp Pro lie Pro Trp Glu His Glu Asp Gin Glu T.or Gly Glu 



4 Or 



Uy Val Ly< 



4 10 



Alb 



1 1 -= 



sapiens 



, 1 1 -:e :<\i.i . : a: t ci^tcag ctacgagtat ggggattaca gegacctet 

qt.q.i.r-r.::; t.g;atggcgc etgcctggcc atcgacccgc tgegcgtgq 

or a i -i t -i. «■ : ;\\itci tc:t ggtgggggtg ccgggcaatq coatggtgg 
qgg i <igcr > ; u jqao ggtgggtgcc acctggttgc tccacotgg 

r. tgrtgtqur qrttgictct. goccatcctg gcagtgccca ttqcccgtgg 

■:cgt.;tyr.c ciqtgggctg tcgggcgctg ccctccatca tcctg:oga: 

.iqcutccT.es t:ooqg cage tctcagtgcc gacctctgct tcctggctct 

rgGi ;;aro: ■. - g u . t ragcg qg cgtgcggg qtgcaggtgg cetgtggggc 

ctg ro: t - :: t g"* cssccg: gc:ctccgc: atctaccqcc gqetgeaesa 

ccd'-icccqqc tg s;qtqt~rr ggtgqactac ggcggctcct ccagcaccqa 

dcspoaicr qg*o.tctttt tggcttcctg gggee cc tgg tggccgtgg: 

• tqt "to ; " tgtqctqggc ag:ccqacg-: tgccggccgc ogggcacaeu: 

ggg: i : v r i Lctgctgggc ao setae car :::tgctggggc tggtgctcac 

ccgaact:::; saet.cetggc caggqecetg egggctgaac ceo teat eg t 

clcgcl riC ; i get gee tea a tcccatqctc itcctgtatt ttgggagqgc 

eggtcao q: cage t gee tg teact gggee etgagggagt ceeagqqeea 

qtggaoaqea agaaatccac cagecatgao etggtctcgg agatgqaqgt 



ggaccgccct 
cc cgctccca 
etggqtggct 
egtgqcggat 
a ggccactgg 
catgtatgee 
eggg sotgee 
age ctgga ca 
ggagcact tc 
gaatgoggt g 
eagetgecac 
o atet gtgqtg 
t g t g g c ggee 
gggccttgcc 
teaaetccqc 
gqacguaagt 
g t a g 



60 
120 

1 8 0 

2 4 0 
.3 0 0 

3 6 0 
■' 2 o 

H 

5 An 
60 0 

6 60 



8 4 0 

9 00 
9 60 

.014 



<210> 6 

•Gil • J 3': 

<212> PPT 

< 2 1 3 > H omo sapiens 



Page 6 



WO <H/3(>-Tl 



PC 1 l S00/3 1 50*> 



<400> 6 

Met Gly Asn Asp Se: Val Ser Tyr Glu Tyr G.y Asp Tyr Se: Asp Leu 
1 5 1 0 1 5 

Ser Asp Ara Pro Val Asp Cys Leu Asp Gly Ala Cy^ Leu Ala lie Asp 

20 25 30 

Pro Leu Arg Val Ala Pro Leu Pro Leu Tyr Ala Ala lie Phe Leu Val 
3 5 4 0 4 5 

Gly Val Pro Gly Asn Ala Met Val Ala Trp Val Ala Gly Lys Val Ala 
50 55 60 

Arg Arg Arg Val Gly Ala Thr Trp Leu Leu His Leu Ala Val Ala Asp 

65 70 75 80 

Leu Leu Cys Cys Leu Ser Leu Pro lie Leu Ala Val Pro lie Ala Arg 

35 90 95 

Giy Gly His Trp Pro Tyr Gly Ala Val Gly Cys Arg Ala Leu Pro Ser 
100 105 HO 

lie lie Leu Leu Thr Met Tyr Ala Ser Val Leu Leu Leu Ala Ala Leu 
115 120 125 

Ser Ala Asp Leu Cys Phe Leu Ala Leu Gly Pro Ala Trp Trp Ser Thr 
130 135 140 

Val Gin Arg Ala Cvs Gly Val Gin Val Ala Cys Gly Ala Ala Trp Thr 
145 ^ 150 155 160 

Leu Ala Leu Leu Leu Thr Val Pro Ser Ala He Tyr Arg Arg Leu His 
lb5 170 175 

Gin Glu His Phe Pro Ala Arg Leu Gin Cys Val Val Asp Tyr Gly Gly 
180 185 190 

Ser Ser Ser Thr Glu Asn Ala Val Thr Ala He Arg Phe Leu Phe Gly 

inn one; 

li?3 <L<UKJ 

Phe Leu Gly Pro Leu Val Ala Val Ala Ser Cys His Ser Ala Leu Leu 
210 215 220 

Cys Trp Ala Ala Arg Arg Cys Arg Pro Leu Glv Thr Ala He Val Val 
225 230 235 240 

Gly Phe Phe Val Cys Trp Ala Pro Tyr His Leu Leu Gly Leu Val Leu 

245 250 255 

Thr Val Ala Ala Pro Asn Ser Ala Leu Leu Ala Arg Ala Leu Arg Ala 
260 265 270 

Glu Pro Leu He Val Gly Leu Ala Leu Ala His Ser Cys Leu Asn Pro 
275 280 285 

Met Leu Phe Leu Tyr Phe Gly Arg Ala Gin Leu Arg Arg Ser Leu Pro 
290 295 300 

Ala Ala Cys His Trp Ala Leu Arg Glu Ser Gin Gly Gin Asp Glu Ser 

305 310 315 320 

Val Asp Ser Lys Lys Ser Thr Ser His Asp Leu Val Ser Glu Met Glu 

325 330 335 
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w o 01 j(».r i 



P( I I son 3150*) 



dn; 



.itgttqtcrt c 
:ccr.q:aaqg 
■jaact teat a 
t t c t ■:: tccct 
g g g - ctgc c 
? t g 4 .-i g t t t a 
..i jgegg :cc: 
ctga t cage a 
gqggc tgctg 
g t c t r. c ctca 
ctg agecqtg 
ctg c t ortca 
agggt gqgea 
t.tct - - :tgc 
aaccg t gg to 

grgg -cgtCt 

gctt tctgqc 
ctgge :tr 
aact: r :t ce 
ag eg a eg aga 
geggaggeca 
t cccaggqct 



a e g t : :] g t g g 
c t. a g a n g a c v. 
a c c t g a g e t c 

accccii c etc 
t;:-:^gc:t 

a c c t.;j:c:ct 
c :t.g-:daaqt 
eagceat cgc 
-tLe-gtggg 
a -gggeacct 
z-iaa ] :cctc 
:.jc: gee get 
tgqgcgggca 
a :: ( ic :arcig 
tqt c -gectg 
c ^ t a octcaa 
accagagceg 
g j t c r.acca 
t agggaage t 
ga 



c c a g c t g a t a 
jcaca a c e 1 1 
tccatctcce 
tgetccetct 
ttectegct g 
gg t ggg gaa: 
acggtgtt r 
ccgcgtggae 
caacctcttc 
sctcaa:cgc 
ggcagctge:: 
g c t c c t g a g e 
ggcetcget z 
:a:cct:t:.t 
ggeaggcccg 
: 1 1 c 1 1 3 e e - 
rcgatcce t 7 
:agtgt zetg 
ggecttgetg 
i c e c t c e a g g 
laaag tgcag 



et 



e t 



g t g ecaa t ca 
ct::t;eaqqcc 
t c t e t c t e ■: 
gcclti: act 
g t g r c 1 1 z <: 
a q 1 1 1 g j :: e c 
ctg qtcagee 
t a 3 1 a c c t c c 
a t gc t g tcca 
tac:tgaagq 
egggtggccg 
a c c 1 1 c f e c g 
egergg-a cc 
g c t a 1 1 g t g a 
eag agggc ca 
a g e a tcatc: 
g a e c t c t g c a 
gaccccgtgc 
ggeet caegc 
e a g t g g c q c t 
gqegagg-rct 



t ecca ::t:;j 
e a t g g e c e a a 
c:tctgtrct 
ctgtgggggg 
t ect ggcace 

t g g t j 3 c g 

t ;caigaqa: 

c c a a e r g c a e 

tggtgcagc: 

ggggaete tg 

gccc::c:tg 

aigcact^tH 

geat rgggct 

tgcgtgtqct 

t tqgeatqg e 

cacaqctct: 

tctactg::: 

ggggccggoa 

acegggagge 

c t c t gga aaa 



eeetgagcac 
gcaq e e ^ a t j 
eectc eetec 
gtcctctgga 
a a t. c c l g g e c 
::t.qca:-:: a r 
t gactt =ct c 
etggcgeL 1 1 
ggccagcgtt 
c c a c c a c g t g 
qgtgggcat e 
■::tcagcta: 
rrtgciggag 
caccarccqg 
ggccatggtg 
rtccatgg*. g 
2 ratggcte 2 
z totag ccc z 
gggcccagt 5 
.rtctagqaag 
gqaagqetc; 



1 y 'j 
3 0 0 



'j H 0 
r >-10 
6 0 0 

6 oO 

7 K 0 
8-1 0 
9 00 
9 ^.0 

1 02 0 

I 08 0 

II AO 
1200 
1260 



^ I 0 > 8 

•:21L.> 423 

*:2 12> PRT 

• ' 2 I 3 > H or. o 3 a p iens 

*:4 00> ft 

Met Leu Gys His Arq Gly Gly Gin Leu lie Val Pro lie lie Pro Leu 
1 S 1 15 

Cys Pro Glu His Ser Cys Arg Gly Arg Arg Leu Gin Asn Leu Leu Ser 

Page S 



wo oi 3<>-ri 



P( "17 1' SI Ml/3150 4 ) 



20 25 30 

Gi v Pro Trp Pro Lys Gin Pro Met Glu Leu His; Asn Leu Ser Ser Pro 
"* 3 5 4 0 4 5 

Ser Pro Ser Leu Ser Ser Ser Val Leu Pro Pro Ser Phe Ser Pro Ser 

hC 5 5 60 

Pro Ser Ser Ala Pro Ser Ala Phe Thr Thr Val Gly Gly Ser Ser Gly 
65 70 75 80 

Glv Pro Cys His Pro Thr Ser Ser Ser Leu Val Ser Ala Phe Leu Ala 
85 90 95 

Pro lie Leu Ala Leu Glu Phe Val Leu Gly Leu Val Gly Asn Ser Leu 
100 105 110 

Ala Leu Phe lie Phe Cys lie His Thr Arg Pro Trp Thr Ser Asn Thr 
115 120 125 

Va i Phe Leu Val Ser Leu Val Ala Ala Asp Phe Leu Leu lie Ser Asn 

1 < - 13 5 14 0 

Lou Ft; Lou Arg Val Asp Tyr Tyr Leu Leu His Glu Thr Trp Arg Phe 
145 150 155 160 

GJy ALi Ala Ala Cys Lys Val Asn Leu Phe Met Leu Ser Thr Asn Arg 
165 170 175 

Tr.r /-. w: G-T Val Val Phe Leu Thr Ala lie Ala Leu Asn Arg Tyr Leu 
13C 185 190 

Lys VH v-. I Gin Pro His His Val Leu Ser Arc Ala Ser Val Gly Ala 

195 200 " 205 

Ala Ai C ; Aro Val Ala Gly Gly Leu Trp Val Gly lie Leu Leu Leu Asn 

21o 215 220 

Gly Hi: Lou Leu Leu Ser Thr Phe Ser Gly Pro Ser Cys Leu Ser Tyr 
22 5 230 23 5 240 

Arg Val Gly Thr Lys Pro Ser Ala Ser Leu Arg Trp His Gin Ala Leu 
245 250 - 255 

Tyr Lou Leu Glu Phe Phe Leu Pro Leu Ala Leu He Leu Phe Ala lie 
260 265 270 

Val Ser lie Gly Leu Thr He Arg Asn Arg Gly Leu Gly Gly Gin Ala 
275 280 285 

Gly Pro Gin Arg Ala Met Arg Val Leu Ala Met Val Val Ala Val Tyr 

290 295 300 

Thr He Cys Phe Leu Pro Ser He He Phe Gly Met Ala Ser Met Val 

305 ' 310 315 320 

Ala Phe Trp Leu Ser Ala Cys Arg Ser Leu Asp Leu Cys Thr Gin Leu 
325 330 335 

Phe His Gly Ser Leu Ala Phe Thr Tyr Leu Asn Ser Val Leu Asp Pro 
340 345 350 

Val Leu Tyr Cys Phe Ser Ser Pro Asn Phe Leu His Gin Ser Arg Ala 

355 360 365 
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WO 01 3(.4~l 



l*( I I MM) 315t>«> 



L~u Leo GLy Lou Tar Arq GJ y Arg G.ln ' Uv P:o V\ : I Sei Asp Glu S~ : 



Gl n Trn Arq 



/ r r .: 

3 



Glu A: a Tie Giy Lys Lou Lyc Va J • i :. C-Jy 



4 10 



L\y Glu G;y Ser Ser by: (_■ * y 
4 2 0 

■ 2 i 0 • 9 
<2l 1 * ^66 
-.212 > DMA 
2 1 3 * H orno s a p l o n s 



2 0 



t y.qd.icc f iga ctttgaataq yagtqqgacc gtqgagt cag 
aqca^agtqc acacqq^cr.a cct.ggtgctg agetccctgg 



yggatggcag gcaayaqcat ggtgatctgg 

tictcjcatct a tat cc tea a 

tccargctca gectggaaac 

a tqaagaqac t^atqtacLt 

a o.:caqcq:t gtctctetgt 

y z g t c a g c ■;: t g gg t g r. g t g g 

t. c 1 1 c c 1 t c t qcagcaagtt 

y t. e caggc ::c crctcatcat 

::ctttg T .;:t gqgtqcggag 

' : - 0 g r gg ? - - c tggcctcLqt 

: qqt * Z g ' . qc tctactggtt 

t c.j cqcctct cytcgtccgt 

< : g r cqgagy a geca caqq ct 



:. g::tgggct 
r-ctggeggea gccgacetcc 
:cagcccctg gtcaacacca 
tg:ctacaca g t g g q c c t g a 
:ctcttccct acctggttca 
jztgctgtgg acacrytqtc 
:t:gaaattc a a t. gaaga t c 
gggggtctta accc:agtga 
qa 3d. cccag cagtqgcggc 
"tggtgttc ctcaictgtt 
q a g c c t g c c g c c eg agar g c 
aagcagcagc g e c a a c c c c g 



gcccaccagg 



y.gggga 



lu Ala Ser Arc ly:. 



o lu Val Ser Lea 2 u 
415 



^gyg.-i gg age engage tgg a aggtggggag af:qccca>: 



cccta a a c t a 
ccatgt' tcac 
ttcgaatgea 
tcttcctctt 
ctgacaaggt 
qcctgctqac 
agtqtcaccg 
tcctgargaa 
ggtgcttcag 
:.gac::rgtc 
g g c a g e ccac 
ccctg:ctcr. 
aggtc:tgtg 
tcatc :a ctt 
ctgigctc era 
t g g g c a c c a a 



Z t c :: a g a g g c 
ct gec: t g t.gc 
caggaacccc 
cagcatggct 
cca ;gag.:tg 
ggeca t cage 
gcccaggcac 
egggt tgacc 
ggtggacatg 
cage., tgacc 
a c g g e tqt.t c 
gage a tc t ac 
cttcagctig 
cctggt gggc 
acaggegett 
tgagatgqgg 



1 2 0 

2 4 0 

3 0 0 
2 6 0 

4 20 

4 80 

5 4 0 
*j0 0 
0 » ,y 
V 2' 'j 
780 

h 4 o 

0 0 0 
0 60 
0 6 6 



■ ^ : 0 • 1 0 

■ 21] • 321 
■21c- P~;T 

• 2 J 0 • H j mo sapiens 

-400- 1 0 

Met Ash Gin Thr Leu Asn Ser Ser Gly Thr VaJ Glu Ser Ala Lou Asn 



Tyr Ser Arq Gly Ser Thr Val His Thr Ala Tyr Leu Val Leu Ser Ser 



2 0 



25 



30 



Paqe 10 



wo m/3<>-m 



PC I l SOO/3 1 50V 



i.eu Ala Met Phe Thi Cys Leu Cvs Giy Met Ala Gly Asn Ser Met Va 1 
35 40 45 

He Trp Leu Leu Gly Phe Ara Met His Arg Asn Pro Phe Cys He Tyr 
50* 55" 60 

He Leu Asn Leu Ala Ala Ala Asp Leu Leu Phe Leu Phe Ser Met Ala 

65 70 * 75 80 

Ser Thr Leu Ser Leu Glu Thr Gin Pro Leu Val Asn Thr Thr Asp Lys 

85 90 95 

Val His Glu Leu Met Lys Arg Leu Met Tyr Phe Ala Tyr Thr Val Gly 
100 105 110 

Leu Ser Leu Leu Thr Ala He Ser Thr Gin Arg Cys Leu Ser Val Leu 
115 120 125 

Phe Pro He Trp Phe Lys Cys His Arg Pro Arg His Leu Ser Ala Trp 
130 135 " 140 

Val Cys Gly Leu Leu Trp Thr Leu Cys Leu Leu Met Asn Gly Leu Thr 

145 150 155 " 160 

Ser Ser Phe Cys Ser Lys Phe Leu Lys Phe Asn Glu Asp Arg Cys Phe 
165 170 175 

Arg Val Asp Met Val Gin Ala Ala Leu lie Met Gly Val Leu Thr Pro 
180 185 190 

Val Met Thr Leu Ser Ser Leu Thr Leu Phe Val Trp Val Arg Arg Ser 

195 200 205 

Ser Gin Gin Trp Arg Arg Gin Pro Thr Arg Leu Phe Val Val Val Leu 

HO 215 220 

Ala Ger Val Leu Val Phe Leu He Cys Ser Leu Pro Leu Ser He Tyr 

225 230 235 240 

Trp Phe Val Leu Tyr Trp Leu Ser Leu Pro Pro Glu Met Gin Val Leu 
245 250 255 

Cys Phe Ser Leu Ser Arg Leu Ser Ser Ser Val Ser Ser Ser Ala Asn 
260 265 270 

Pro Val He Tyr Phe Leu Val Gly Ser Arg Arg Ser His Arg Leu Pro 

275 280 285 

Thr Arg Ser Leu Gly Thr Val Leu Gin Gin Ala Leu Arg Glu Glu Pro 
290 295 300 

G_u Leu Glu Gly Gly Glu Thr Pro Thr Val Gly Thr Asn Glu Met Giy 

305 310 315 320 

Ala 



<21Q> 11 

<211> 1356 

<:i!2> DNA 

^213^- Homo sapiens 

<:400> 11 

atggagtcct cacccatccc ccagtcatca gggaactctt ccactttggg gagggtccct 60 
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,\:aciccccac g t r C: : t c t. d c tgccaqtggg ..: t , . q i i J j t gqqgc *:. acq 'T}jt u: '. j :* l.'c 

t:::-|^arctq tqgddett CI Z -r. at a c Z cractcqact t g a ct q ct g t g g c t g g c a a t 1 - C 

qccqcr.ytaa t g g c r g t g ^ t cgcca a a acq c c t g c cc:::c: g a a a a 1 1 L g t c t r. c: g t: c *_ t c f: ' -3 0 

;;a.:c:tctgcc tqgtggacct gctgqctqc: ci ^acccf ^ :i t dccot ggc L:utgct:t.;:: 3 00 

aqctctyccc tctt: caeca cgcc:Vctt: gqggaqqtgq :::g:cg:: T : cr jc t:t:;tt r 

clga g c g t g t g c t 1 1 g t c a g c ciggcc.it / ct :tcgqr g t c^j: c a z :aa t. g t g g a c g -;■ 4 2 2 

tactattarg tagtc race:: catgegctao qaggtgcqca tgargctggq qctggtggcc 4 SO 

tctgtgctgg tgggtgtgtg ggicjaaqgcc ctggccatgg rttdgtgcc agtgct. gaga 5-10! 

cj g g j t c t c c t g g g a g g a a g g a get: c c a g r gt :cc: c l: .3 g g c t g t t c 3 c t ccagtgga g c 6 0 0 

ca:agtg:-:t actgoca^ct ttttgtggtg gtcttlgctg ice t t t a r ~. t tctgt tgecc 600 

.; tg;tc-:tca t a c 1 1 g t g g t eta c gca g c a tgtt:c g a g 1 g g c c c 14 c g t y g c t . geca t. g '/ 0 0 

cagcacgggc cg::g:c-:ac gtggatggag acacc;cggc aa;gctc-:ga atctct:agc 7y0 

agccgctcca cgatggtcac cagctegggg gccccccaga ccaccccaca ccggacgttt 8-10 

gggggaggga aagcagcagt ggtt dec tg gctgtggggg gaeagttcci gctctgttgg 900 

tig redact tctctttcca rctctatgtt gcrctgagtg ctcagcceat ticaaciggg c, oJ 

caggt.ggaga gtgtggtcac ctggattggc ta:ttctg:t zcaczzc ;aa ccctt:c*:r.r 102 J 

tatggatgtc tcaaceggea gaiccggggg gage tea gca dqcagt ttqt; ctgctccrtc: 108 0 

iagccagctc cagag<gagga gctgaggctg cctaqccggg agggctccat tgaggagaac 11*10 

rtcetgcagt tedtraggg gactqgctgt jcttctqagt cotgggtttc ccqaccccta 1700 

-:ccagc<::<~;ca ageaggagee acctgergtt ga :tt::qaa tcccaggcca gatagcigaq I2b0 

ga ga cct ct g agttcctgga qcagcaactc a cca g eg a ca tcatcatgt.c aqacagctac 11-20 

z z c c q t c c 1 g ccgcctcacc ccggctqgag tcatqa 13 5 0 

-'2 10"- 12 
-.2 1 1 > 4 SI 
7 1 2 * PRT 
< 2 1 3 > H omo sapiens 

:4 00 12 

Met GIm Set Ser ?ro lie Pro Gin Ser Ser Gly Asn Ser Ser Thr Lei: 
1 ■, 10 15 

Gly Arg 7a 1 Pro Glr: Thr Pro Gly Pro Ser Thr Ala Ser Gly Val Pro 



Glu Val Gly Leu Arg Asp Val Ala Ser Glu Ser Val Ala Leu Phe Pho 
3 5 " 4 0 4 5 

Met Leu Leu Leu Asp Leu Thr Ala Val Ala Gly Asn Ala Ala Val Met. 
50 5 5 60 

Pase 12 



WO 01/304 "M 



P<T/l'S00/3150<> 



Ala Va 1 He Ala Ly:, Thr Pro Ala Leu Aig Lys Phe Val The Val Phe 

6 5 7 0 "'5 8 0 

His Lei Cys Leu Val Asp Leu Leu Ala Ala Leu Thr Leu Met Pro Leu 

8 5 90 9 5 

Ala Met Leu Ser Ser Gei Ala Leu Phe Asp His Ala Leu Phe Gly Glu 
100 105 110 

Val Ala Cys Arq Leu Tyr Leu Phe Leu Ser Val Cys Phe Val Ser Leu 
115 ' 120 125 

Ala lie Leu Ser Val Ser Ala lie Asn Val Glu Arg Tyr Tyr Tyr Val 
130 135 140 

Val His Pre Met Arg Tyr Glu Val Arq Met Thr Leu Gly Leu Val Ala 
145 150 155 160 

Ser Val Leu Val Gly Val Trp Val Lys Ala Leu Ala Met Ala Ser Val 
165 170 175 

Pro Val Leu Gly Arg Val Ser Trp Glu Glu Gly Ala Pro Ser Val Pro 
180 185 190 

Pro Gly Cys Ser Leu Gin Trp Ser His Ser Ala Tyr Cys Gin Leu Phe 
195 200 205 

Val Val Val Phe Ala Val Leu Tyr Phe Leu Leu Pro Leu Leu Leu lie 
210 215 220 

Leu Val Val Tyr Cys Ser Met Phe Arq Val Ala Arg Val Ala Ala Met 
225 230 " 235 240 

Gin His Gly Pro Leu Pro Thr Trp Met Glu Thr Pro Arg Gin Ara Ser 
245 250 255 

Glu Ser Leu Ser Ser Arg Ser Thr Met Val Thr Ser Ser Gly Ala Pro 
260 265 270 

Gin Thr Thr Pro His Arg Thr Phe Gly Gly Gly Lys Ala Ala Val Val 
275 280 285 

Leu Leu Ala Val Gly Gly Gin Phe Leu Leu Cys Trp Leu Pro Tyr Phe 
290 295 300 

Ser Phe His Leu Tyr Val Ala Leu Ser Ala Gin Pro lie Ser Thr Gly 
305 310 315 320 

Gin Val Glu Ser Val Val Thr Trp lie Gly Tyr Phe Cys Phe Thr Ser 
325 330 335 

Asn Pro Phe Phe Tyr Gly Cys Leu Asn Arg Gin lie Arg Gly Glu Leu 

340 345 350 

Ser Lys Gin Phe Val Cys Phe Phe Lys Pro Ala Pre Glu Glu Glu Leu 

355 360 365 

Arg Leu Pro Ser Arg Glu Gly Ser lie Glu Glu Asn Phe Leu Gin Phe 

370 375 380 

Leu Gin Gly Thr Gly Cys Pro Ser Glu Ser Trp Val Ser Arg Pre- Leu 
385 390 395 400 

Pro Ser Pro Lys Gin Glu Pro Pro Ala Val Asp Phe Arg lie Pro Gly 
4 05 4 10 4 15 
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c J 1 r i lie Ala Glu l.;.Lu Th: Se: GLu The L-u Glu Gin Gin Leu Ti.r 

42^ •;. •. 4 3 0 



Leu Glu Ser 
■3 50 

• : : : • i 3 

^211- 10 A 1 

-.212 • DMA 

■;213' Homo sapiens 

<400 • 13 

»a tgq. t gaga a a at. ttat.gtc cttqcaacca t.ccatctccg tatcagaaat gqaaccaaar »50 

i^ r jiS.:cttca gcaataaraa c a g c a q g a a c tgcaeaattg a a a a c 1 1. z a a qagag.iattt 12 0 

rtcc::aattg tar at ct.gat aatat:tt:-:: tggngagtct tqqqaaatqc :ttqtL:ca:d 1H0 

•.atijttttcc tgcagcctta taagaagtcc acatctgtga acgttttcat gctaaatctq 24 0 

gccatttcag atctcctgtt cataagcacq cttcccttca gggctgacta ctarcttaga 300 

ggctccaatt qgatattrgg agac:tggcc tgcaggatta tgtcttattc rttgtatgtc 360 

aacatgtaca qcagtattta tttcctgacc gtgctgagtq ttgtqcgttt :ctgq:aatg 4 2 0 

gvtcacccct ttcgqcttct g:atgtcacc agcatcagga gtgcctggat rctctgtggg 

arcatatgga tccttatca: g get tec tea ataatgetec tggacagtgg ctctgagcag 54 0 

aacqgeagtq rcacatcatq cltagagctg aa t ct eta t a aaatt get a a get.qcagacc *5<>0 

a *;ga aetata ttgccttggt ggtgggerge ctgctgccat ttttcacact eaqeatctgt 6 60 

tatcrgctga tcattcqgqt t rtgttaaaa grggaggtcc cagaateggg getgegggtt ~ ? 2 0 

tcti.Mcagga aggcact gac cacc3tca:c atcacettga tcatcttctt Tttgtgtttc 7 i"- 

ctgecetatc acacactgag gaccgtccac ttgacgacat ggaaagtggg tttatgcaaa B4 0 

gaeanactge ataaaqcttt ggttatcaca ctggccttgg cagcagccaa tgectgette 90 O 

aatcctctqc tctattactt tgctggggag aattttaagg acaqactaaa ctctqcact.r 9'>0 

agaaaaggee atccacagaa ggcaaagaca aagtgtqttt tccctgttag tqtgtggttg 102 0 

aqaaaggaaa caagagtataa 1041 

-g:iq- 14 

<21 : 34C 
<2 12- PRT 
< 2 1 3 • H omc sapiens 

--4 CO - 14 

Met Glu Arg Lys Phe Met Ser Leu Glr Pro Ser lie Ser Val Ser Glu 
1 3 10 1 : 

Met Glu Fro Asn Giy Thr Phe Ser Asn Asn Asn Ser Arg Asn Cys Thr 

20 25 30 

Page 1 4 
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T J. g- G 1 u Asn Phe Ly s Arc Glu Ph e Phe Pro He V a 1 T y r Leu lie He 
3 5 '1 0 A 5 

Phe D he Trp Giy Va: Leu G)y Asm Gly Leu Ser He Tyr Vai. Phe .eu 
'3 0 5 5~ 60 

Gin Pro Tyr Lys Lys Ser Thr Ser Val Asn Vai Pne Met Lea Asn Leu 

65 70 75 BO 

Ala He Ser Asp Leu Leu Phe lie Ser Thr Leu Pro Phe Arg Aia Asp 

85 90 95 

Tyr Tyr Leu Arq Gly Ser Asn Trp He Phe Gly Asp Leu Ala Cys Arg 
100 105 110 

He Met Ser Tyr Ser Leu Tyr Val Asn Met Tyr Ser Ser He Tyr Phe 
115 " 1 2 0 125 

Leu Thr Val Leu Ser Val Val Arg Phe Leu Ala Met Val His Pro Phe 
130 135 140 

Arg Leu Leu His Val Thr Ser lie Arg Ser Ala Trp He Leu Cys Gly 
1*15 150 155 160 

lie He Trp He Leu He Met Ala Ser Ser He Met Leu Leu Asp Ser 
165 170 175 

Gly Ser Glu Gin Asn Gly Ser Val Thr Ser Cys Leu Glu Leu Asn Leu 
180 185 190 

Tyr Lys He Ala Lys Leu Gin Thr Met Asn Tyr lie Ala Leu Val Val 
195 200 305 

Giy Cys Leu Leu Pro Phe Phe Thr Leu Ser He Cys Tyr Leu Leu He 

::io 215 220 

lie Arg Val Leu Leu Lys Val Glu Val Pre Glu Ser Gly Leu Arg Val 
235 230 235 240 

Ser His Arg Lys Ala Leu Thr Thr He He lie Thr Leu He He Phe 

24^ 250 255 

Phe Leu Cys Phe Leu Pro Tyr His Thr Leu Arq Thr Vai His Leu Thr 
260 265 270 

Thr Trp Lys Val Gly Leu Cys Lys Asp Arg Leu His Lys Ala Leu Val 
275 280 2P5 

He Thr Leu Ala Leu Ala Ala Ala Asn Ala Cys Phe Asn Pro Leu Leu 

290 29^ 300 

Tyr Tyr Phe Ala Gly Glu Asn Phe Lys Asp Arg Leu Lys Ser Ala Leu 

:"i05 310 315 320 

Arg Lys Gly His Pro Gin Lys Ala Lys Thr Lys Cys Val Phe Pro Val 
325 330 335 

Ser Val Trp Leu Arg Lys Glu Thr Arg Val 
340 345 

<::10> 15 

<H1> 152 7 

«:J12> DNA 

<213> Homo sapiens 
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- .. 4 0 0 > 1 5 
. : r i • ; - i • .• : \; 

■;t::C'.: a a a a 

t tcctcqccq 

cage tact ;:c 

aCLtcq-t 

ag jcact t 
a ccatt-^tcg 

c t qcagag ~:a 

ritcqtcattx 
aggcagcatg 
tgtgtggaqa 
gag tttcgee 
ga -ggcagee 
aggggcagcg 
aaggaagg :a 
qccaac :aqt 
aat ttcagtg 
eg taacagca 
atcttcs: za 
ctggcc gtgt 
r.qgctt:::r. 
at tii a 13 aagg 
qaagata gee 
rcc:acqa:t 



tgcc;.:t caq 
cc r ct. r* cat 
a gat qac raa 
taqc:':crtq 
cjcacggcc: r. 
tgqtgr.La gt 
cccagcqccg 
ctcctcca c t 
tgat -tqggg 
cactgartqt 
ct ctgctgt a 
a t ga ggatga 
geeagcatga 
t.gaagqccaa 
aggagg t cag 
qcaccaaagt. 
gcaqca t tga 
agga tgacgt 
acagcaac c c 

gggtgqatqt 
tc:t g e 3 g t g 
aaat:o a g g a 
ac:ca g a c c t 
crqctactU 



:ctqqc cca e 
r g g c a a c a t a 
;cn:tttatc 
g gtqqt gg.:c 
qgt tagee :c 
gga tegctac 
egg tt; a a ctg 
z t aeggctqq 
ggccagcccc 
cat gat tgee 
caa tgtc aag 
agagggagca 
aggtgaggtc 
ggaaggaagc 
agagag cage 
t g a g g a g a a e 
c t tggg tga a 
egagqeagtg 
Lcctctqccc 
era c gtgeta 
cgaaa::cag 
c:g-:at:cac 
catg:tgaag 
gecc ggaaca 
::cttga 



g a g a q t a a e a 
qqca t era t cc 
gtgctggc g c 
t rt,idc..t:c 
acci crgtgc 
accca^: c z g t 
ttgtccacca 
c:cct«::atg 
ggccaggctg 
agc:a :acta 
tgc laciccg 
agacacagc: 
g a g a a g a a g g 
a a g g c c a a g q 
acgqggacca 
acggtggcca 
agcatgaa gg 
ga t gaca t gg 
aacat:ccgg 
a g g t g ::acc 
tccctgggg: 
or accccag t: 
ccctatq t ct 
aagttrttct 
gaggqtggga 



a:aq:'cacar 
act c a a c :: g t 
t ag tqttg :a 
teg r ■; -i c r g n 
' r : r r. t g 
t egcett :gc 
tc za ccctct 
g ;ac :tggat 
c z t 1 1 g a t g a 
1 1 ct rag zgt 
tggtgttctg 
tqgaagtgcg 
aggagtt ;ca 
agggcagaa : 
gt gag agt aq 
g c g a c g g j a g 
ca g a eaag gg 
agtt tggtga 
agag:ct:cc 
a g t g c a a a g c 
cctactg _tt 
gggtgat zac. 
a t g g c t a :: a t 
gcaa ggaaaa 
c t gaaqgcaa 



qcg 
c c t 
g :c 
c a g 
c t c 

g q 

tq.: 



gt qcatqccc 
q c t g q 1 1 a t c 
caagc rg 
ctgcag 
:tcaa c 
gt caac 
t a c e r q 
gqecate 
caat g 
gtccttc 
agecegg 
a a g g a c 
ggatgagagt 
ggaagee aag 
t qcagag gee 
catggagggt 
t :: q :acaga g 
a g a z g a c a t c 
ac^agtcgt 
t q :t a aag t g 
1 1 cagca gt c 
c a t a a t c a t c 
g c a caagacc 
g::c:cgaaa 
gattgtccct 



1 JO 

1 o 0 

2 -i 0 

3 0C 

3 60 

4 J 0 

A 'r 0 



7 c 1.1 

e ■; 0 

9 On 

10 JO 

10*0 

1 1 0 
uoo 



1 3 -< 1 ) 

1 4 A 0 
15 0 0 
15J^ 



■-■210 > 16 

• J ] I > 508 

1 J > PR T 

■ ?13 ' H d^.o sapiens 

< A 0 O > 1 6 

Met Tnr Set Thr Zy$ Thr Asn 

1 c' 



Scr Thr Arq Glu Ser Asn 3er r.er Hi; 
I 0 lb 
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Thr Cys Mot Pro Leu Ser Lys Met Fro He Ser Lou Ala His Gly lie 

2 0 I: ; 5 3 0 

He Arg Ser Thr Vai Leu Val lie Phe Leu Ala Ala Ser Phe Val Gly 

.}. 5 4 0 4 b 

Asn He Veil Leu Ala Leu Val Leu Gin Arg Lys Pro Gin Lou Leu Gin 

5C 55 60 

Val Thr Asn Arq Phe He Phe Asn Leu Leu Val Thr Asp Leu Leu Gin 

65 " 70 75 80 

He Ser Leu Val Ala Pro Trp Val Val Ala Thr Ser Val Pre Leu Phe 
85 90 95 

Trp Pro Leu Asn Ser His Phe Cys Thr Ala Leu Val Ser Leu Thr His 

100 105 110 

Leu Phe Ala Phe Ala Ser Val Asn Thr He Vai Val Val Ser Val Asp 
H 5 12 0 12 5 

Arg Tyr Leu Ser lie lie His Pro Leu Ser Tyr Pro Ser Lys Met Thr 
130 135 140 

Gin Arg Arg Gly Tyr Leu Leu Leu Tyr Gly Thr Trp He Val Ala He 
145 150 155 160 

Leu Gin Ser Thr Pro Pro Leu Tyr Gly Trp Gly Glr. Ala Ala Phe Asp 
165 17C 175 

Glu Arg Asn Ala Leu Cys Ser Met He Trp Gly Ala Ser Pro Ser Tyr 
180 185 190 

Thr He Leu Ser Val Val Ser Phe He Val He Pre. Leu He Val Met 
195 200 205 

lie Ala Cys Tyr Ser Val Vai Phe Cys Ala Ala Arq Arg Gin His Ala 
210 215 220 

Leu Leu Tyr Asn Vai Lys Arg His Ser Leu Glu Val Arg Val Lys Asp 

225 230 235 240 

Cys Val Glu Asn Glu Asp Glu Glu Gly Ala Glu Lys Lys Glu Glu Phe 
245 250 255 

Gin Asp Glu Ser Glu Phe Arg Arg Gin His Glu Gly Glu Val Lys Ala 
260 265 270 

Lys Glu Gly Arg Met Glu Ala Lys Asp Gly Ser Leu Lys Ala Lys Glu 

275 280 285 

Gly Ser Thr Gly Thr Ser Glu Ser Ser Val Glu Ala Arg Gly Ser Glu 

290 295 300 

Glu Val Arg Glu Ser Ser Thr Val Ala Ser Asp Gly Ser Met Glu Gly 
305 310 315 320 

Lys Glu Gly Ser Thr Lys Val Glu Glu Asn Ser Met Lys Ala Asp Lys 
325 330 335 

Gly Arg Thr Glu Val Asn Gin Cys Ser lie Asp Leu Gly Glu Asp Asp 
340 345 350 

Met Glu Phe Gly Glu Asp Asp He Asn Phe Ser Glu Asp Asp Val Glu 

355 360 365 
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.A Ij; V,;l A s r: I I 'j Pro C 1 u Ser Le u P r o P r o 0 - r A rg ^.ra As i : S e r A s n 

.T- ■" 3 7 5 i&i 

Ser A:;n Pro Pro Leu Pro Arg Cys Tyr Gin Cy^ Lyr Ala Ala Ly :. Va L 
;>8r 3 90 3^3. 40 3 

lie t'no lie lie lie Phe Sex Tyr Val Leu Ser Leu Glv Pro Tyr Oy.s 

•10 3) : j i o 

Phe Lea Ala Va 1 Leu Ala Val Trp Val Asp Vnl Glu Thr Gin Val Pro 
>).■> 4 2 5 430 

31 n Tvp Val He Thr He He lie Trp Leu Phe Phe Leu Gin Cys Cys 
4 3 5 440 445 

lie K i -j Pro Tyr Val Tyr Gly Tyr Met Hi:; Lys Thr He Lys Lys Glu 
4 50 4 55 4 GO 

I :.e 31;. Aar Mo t Leu Lys Lys Phe Phe Cys Lys Glu Lys Pro ?r;) Ly^; 
1(7 4 70 4 75 4 8(3 

"lu A;q . • : Hi.o Pre Asp Leu Pro Gly Thr Glu Gly Gly Thr Glu Gly 
4 3 5 4 9 0 4 9 5 

Ly.s lie Val Pro Ser Tyr Asp Ser Ala Thr Phe Pro 
->0Q 50 5 

-:■]'■■ l ■ 

.Hi- l i " r 

■Li. - r so 

<■ 2 1 ; ■ r. . in- .'-.a prions 

,:U|-" r " vi.j j ; : :)ic it r.tc:tcattt gaqgacctct tggctaacaa tatrctcaga 60 

at at t : :j: or. ^-ir l itagc tttcattacc :.gc r t t ggaa atctntttgt cattggca:g 120 

aqav 1 : t::: r r , j , i a go t ga aaatacaact caegctatqt ccatcaaaat cctr.tqtt.gc 180 

: .- 1 a.!** ; Tq^rg^gtgL ttacttgttc tttgttggca ttt toga tat aaaatacoga 24 0 

gggca;:: at ; a q a aa t a tgc cttgctgtgg atggagagcg tgcagtgccg cetcatgggg 300 

t tec t ggr ra tqet.qtccac cgaagtctct gttctgetae tgacctactt gactttggag 3 60 

aaqrtcctqq t. cattqtctt ccccttcagt aacattcgac ctggaaaacg gcagacctca 4 20 

q teat ■.-.tea t.ttocatctg gatggcggga tttttaitag ctgtaattcc attt.tggaat 4 S3 

1 a ggat *..::*' * :acaaactt ttatggg a a a aatqgagtat gtttcccact ttatt at qac 3 4 3 
c a a a c a ■ 1 1 1 n g a t a r t q g a a g c a a a g gg t a t tctcttqgaa 1 1 1 1 c c t a gg t g t g a a c 1 1 g 6 0 0 

2 t g g c t ~ t c v;arc5ttgt. q 1 1 1 1 c c t a t attactatgt tctgttccat t.caaa a dace 66 3 
gccttgcag :i ocacaqaagt aaggaattgt tttggaagaq aggtggctgt tgcaaatcgt 723 

I I c 1. 1 1 r 1 1 a tagtqttctc Lgatgccatc t g c t g g a 1 1 c ctgtatttgt a g 1 1 a a a a t c 7 8 0 

-e 1 1 1 c: c c t c r. tccgggtgga lataccagac a c a a t q a c 1 1 cctggatagt gatttttttc 8 4 C 1 

cttccaqtta aca gtgcttt gaatccaatc ct eta tact c tcacaaccaa cttttttaag 90 0 

gacaacttcu a a cage tact qcacaaacat cagaggaaat caattttcaa aattaaaaaa 960 

Pace 19 
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aaaaqtLtdi ctacatccat tgtgtqgata gaggactccr cttccctqaa acttggggtt 10 2 0 
ttgaacsaaa taacacttgg agacagtata atgaaaccag tttcctag 10 0c 

-:210> 18 



-:2I2> PRT 

• : 2 I 3 > H onio sapiens 

• 4C 0 • 18 

Met Pro Leu Thr Asp Gly lie Ser Ser Phe Glu Asp Leu Leu Ala Asn 

: 5 10 15 

Asn lie Leu Arg lie Phe Val Trp Val lie Ala Phe lie Thr Cys Phe 

20" 25 30 

Gly Asn Leu Phe Val lie Gly Met Arg Ser Phe lie Lys Ala Glu Asn 

35 40 " 45 

Thr Thr His Ala Met Ser lie Lys lie Leu Cys Cys Ala Asp Cys Leu 
.SO 55 60 

Met Gly Val Tyr Leu Phe Phe Val Gly lie Phe Asp lie Lys Tyr Arg 

65 70 75 80 

Gly Gin Tyr Gin Lys Tyr Ala Leu Leu Trp Met Glu Ser Val Gin Cys 
85 90 95 

Arg Leu Met Gly Phe Leu Ala Met Leu Ser Thr Glu Val Ser Val Leu 
100 105 110 

Leu Leu Thr Tyr Leu Thr Leu Glu Lys Phe Leu Val lie Val Phe Pro 
115 120 125 

Phe Ser Asn lie Arg Pro Gly Lys Arg Gin Thr Ser Val lie Leu lie 
130 135 140 

Cys lie Trp Met Ala Gly Phe Leu lie Ala Val lie Pro Phe Trp Asn 
145 150 155 160 

Lys Asp Tyr Phe Gly Asn Phe Tyr Gly Lys Asn Gly Val Cys Phe Pro 
165 170 175 

Leu Tyr Tyr Asp Gin Thr Glu Asp lie Gly Ser Lys Gly Tyr Ser Leu 
180 185 190 

Gly lie Phe Leu Gly Val Asn Leu Leu Ala Phe Leu He He Val Phe 
195 200 205 

Ser Ty r He Thr Met Phe Cys Ser He Gin Lys Thr Ala Leu Gin Thr 
210 215 220 

Thr Glu Val Arg Asn Cys Phe Gly Arg Glu Val Ala Val Ala Asn Arg 
225 230 235 240 

Phe Phe Phe He Val Phe Ser Asp Ala He Cys Trp He Pro Val Phe 
245 250 255 

Val Val Lys He Leu Ser Leu Phe Arg Val Glu He Pro Asp Thr Met 
260 265 270 



Thr Ser Trp He Val He Phe Phe Leu Pro Val Asn Ser Ala Leu Asn 
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Pro lie Leu Tyr Trit Leu Thi Thi A-:. Pi- ie Ph- Lvs Asp Lys Leu Lys 

2' J 0 25: 300 

'Ur. Leu Leu His Lys H^s 2.1:: Aru I.yi 2er : L.- The Lys Ilo 2y. Lyi 
•y 310 " 313 "32 0 

Ly ^ Ser Leu Ser Thr Scr lie V a 1 T r p 11 e Glu Asp 5 e r Ser 2 e 1 Le 

323 330 335 

Lys !.-u lUy Val Leu As- Lys lie Thr Leu Gly Asp Ser lie Me" Lyr 
'340 " 345 " * 35 0 

P r o V ti 1 Ser 
3 5 5 

v 2 I 0 - 1 9 

;2 1 1 • 969 

' 2 1 2 > DMA 

- 2 1 3 > H ott.o sapiens 

4 :0 - 1 

.j::ga:cc'i3 ccarctcaac cttgqacaca gaactgacac csat caacgg aactqaggaq 6 2 

dc:c*;ttg-:t acaagcagac cttgagccts acggtgctga cgtgcatcgt ttcc^tlgrc 120 

qggctgacag gaaacgcagt t gtgctct.gq ctcctggqct gccgcatgcg caggaacqcc 180 

tt ;t'':cal--::t a cat cc tea a ettqqecgra qcagasttcc tcttcctcag cgg:cqccrt. 24 0 

atatattcc: tgttaagctt catcagta: :: ccccatacca tctctaaaat :ct:taLcc: 3 0 0 

gtgatgatjt tttcctactt tgcaggcctg agct. ttrtga gtgccgtgag caccqaqcgc 3 60 

tgcc'.gtc'.q tcctgtqgcc catctggtac cgctgccacc gccccacaca cctgtcaqcg 420 

qtggtgtqtq tcctgctct.q qgccctgtcs ctgctg:;gga gcatcctgga qtggatgtta 4 80 

r gtgqct t tgttcagtgg tgctgattcr. qcttggtqtc aaacatcaga tttcatcaca 54 0 

qtcqogtgq :: tgatttttzt atgtgtggtt ctctgtgggt ccagcctggt cctgctgatc 600 

nggatt:t:c*. gtggatc:cg gaagataccc ctgaccaqgc tgtacgtgac catcctgctc 6 60 

dcagta:tgg tcttcctcct ctgtggcctq ccctttggca ttcagttttt cctattttta 720 

-ggatC'jacq tggacaggga agtcttat tt tgtcatqttc atctagtttc tatttt :cr.c 730 

t::un:ct:a acagcagtgc caac::catc atttacttct tcgtqggctc etttaqqeag 8 4 3; 

cgtoaaaata qgeagaaect gaagctggtt ctccagaqgg etctgeaqqa cqcglctqag 9 00 

qtggatcaag gtggagggca gcttcctgag gaaatcctgg agctqtcggq aaqcaqattg 9t>0 

gag c a q t q a 9 6 ° 



- 2 1 0 
: 2 1 1 
:212 
:213 



PPT 

Heme sapiens 



'4 00:> 20 

Paqe 2 0 
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PCT/l S00/3I509 



Met Asp Pro Thr lie Ser Thr Leu Asp Thr Giu Leu Thr Pro lie Asn 
I 5 10 IS 

GLy Thr Glu Glu Thr Leu Cys Tyr Lvs Gin Thr Leu Ser Leu Thr Val 

20 25 30 

Leu Thr Cys lie Val Ser Leu Val Gly Leu Thr Gly Asn Ala Val Vol 
3b 40 45 

Leu Trp Leu Leu Gly Cys Arg Met Arg Arg Asn Ala Phe Ser lie Tyr 

50 55 60 

lie Leu Asn Leu Ala Ala Ala Asp Phe Leu Phe Leu Ser Gly Arg Leu 

65 70 75 " 80 

lie Tyr Ser Leu Leu Ser Phe lie Ser lie Pro His Thr lie Ser Lvs 
85 90 95 

He Leu Tyr Pro Val Met Met Phe Ser Tyr Phe Ala Gly Leu Ser Phe 

100 105 110 

Leu Ser Ala Val Ser Thr Glu Arg Cys Leu Ser Val Leu Trp Pro He 
..15 120 125 

Trp Tyr Arg Cys His Arg Pro Thr His Leu Ser Ala Val Val Cys Val 
130 135 140 

Leu Leu Trp Ala Leu Ser Leu Leu Arg Ser He Leu Glu Trp Met Leu 

14 5 150 155 160 

Cys Gly Phe Leu Phe Ser Gly Ala Asp Ser Ala Trp Cys Gin Thr Ser 
165 170 175 

Asp Phe He Thr Val Ala Trp Leu He Phe Leu Cys Val Val Leu Cys 
.80 185 190 

Gly Cor Ser Leu Val Leu Leu He Arg He Leu Cys Gly Ser Arg Lvs 
195 2 00 2 05 

J.-1-c i. - '-' ijcu wij. r-Lj_y .lj^ui - _)' * a :ui j_ _l c: utr u -ue L-t i in vai ut;u v ai 

21 0 215 220 

Phe Leu Leu Cys Gly Leu Pro Phe Gly He Gin Phe Phe Leu Phe Leu 

225 230 235 240 

Trp lie His Val Asp Arg Glu Val Leu Phe Cys His Val His Leu VH 
245 250 255 

Ser lie Phe Leu Ser Ala Leu Asn Ser Ser Ala Asn Pro He He Tvr 
260 265 270 

Phe Phe Val Gly Ser Phe Arg Gin Arq Gin Asn Arg Gin Asn Leu Lvs 

275 280 " 285 

Leu Val Leu Gin Arg Ala Leu Gin Asp Ala Ser Glu Val Asp Giu Gly 

290 295 300 

Gly Gly Gin Leu Pro Glu Glu He Leu Glu Leu Ser Gly Ser Arg Leu 
305 310 315 320 

Glu Gin 



:io> 2i 

Lll> 1305 
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-2 12 
^ 1 3 ■ 



LKA 

PL :nc 



■: 4 c ci - l 

-;it c: gag-: .it c 
t tqc tqg ?ct 



..i c l: a c a g t 
a a g a t g 3 a c t 
■^cqct-gtcac 
3 /gtg—cct 
■H'}ctcat :g 
gcgcgtgcc = 
ttcgtggt gc 
cagccaggcg 
ttcgcgcccc 
1 1 rgtc gege 
egqea—gge 
:::qcgccca 
gc gctg=tgt 
tc gtceggge 
jcgatggcca 
:ggc tc-:goc 
;cq qaqgacg 
-r; t crit:a'. c 
.■cgegccc: a 



sa 1 en s 

ictrtci q c -:: c 
ggggtcM caa 

t gtqeegcet 
tect gctggt 
agctggcctq 
tc:t gca g zt 
cccti-g«ig=g 
tcg::gc::t. 
gcggggcictc 
cgcccccggc 
tgccgcqctg 
ctgttacggt 
cgcaggcccc 
gcgcgctgcc 
tcgtggqetg 
ccgegggaga 
acagrgcuct 
gaeagctgeq 
aggaggggee 
ac cat get eg 
gacccct gec 



gt t to tag eg 
gtgrgg cqqc 
^dgctgicc 
ggaactgctg 
g t g :: a g g c a 
tcqg z gcg ::q 

::c:t:g,:cg 
:gc.::gcg:: 
gcacctgcag 
rctggg :g t z 
tgcggctqca 
ecgr grcaag 
zgagctg- ::c 
-tgggaggga 
raa t - cot t c 
gaagcqgctg 
- cogggeca z 
gcggqaaccg 
t tgetcctqc 



ccgccgqc-'j c C'ja.icati*:'.- 
g q q c a a g c a cj r-;:c:.ac-::c: 



: a t c 



retgect c tgggocacec I2u 

gt. oa tec tqg tggtggcgot gg ca qqca a c 1 o 0 

ggrggqcc;:t gggcggq :cc euagegtege 24 0 

ctggcgga:c tgt^cgcgtg ::g 3 gggca og 3 0 0 

ggcgagcccc g.:gcggc:a: gggggacttg 360 

t cegg g ^ g g g g; q cetera: -zzh z e teg: g 4 2 ^ 

gtgcg*-ettc cqca:ga::cg g::gct.g::: 4 3 0 

:tgqcd:tg: tgttggogel g-:::cc:gg:; 5-10 

ctgccqccg: cgccgccgcc aa:gt;c:'..g 600 

tgqecggggg agegtcgetg c-aegggatc 6*50 

gtetacgcgt tctacgaggc cgtcgcgggc 720 

gettgeggee aeetaetetc cgtctqqtqg 7 80 

gcgccctggt cggogag:cc aggr-:.:gag:c 8 4 0 

gtgeagagee tgaagatgag eetgetq^tg 90 0 

taett^gceg cc jggctggc ggccg:;gtgg 9 60 

gaggge^tgt eggeggeget g-qegtggtg 1020 

gt :tacctct tettccaqg: gqgcgictgc 108 J 

ggctctctgt gctgcgcqcc gcagggaggc 1 1 4 0 

taggegctet sccgccaacg 1 1 ggc -cca c 12 00 

-tgqacgagg gcggcttgcg coca-occct 3 260 

g a a h g t g c c t t c t a g 13 0 5 



•:210> 22 

■~211> 4 34 

:212> PP. T 

-21 3> Homo sapiens 

- 4 0 0 > 2 2 

Met Glu Asp Leu Pho Ser Pro 3er lie Leu Pro Pro Ala Pro Asn lit 



10 



1 5 



Dei Val Pro lie Leu Leu Gly Trp Gly Leu Asn Leu Thr Leu Gly Gin 

20 2 5 3 0 

31 y Aid Pro Ala Ser Gly Pro Pro Ser Arq Arq Val Arg Leu Val Phe 



3 5 



4 5 



Page 
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Leu Gly Val Tie Leu Va 1 Val Ala Va. Ala Gly Asn Thr Thr Vn 1 Leu 

->(, 55 60 

Cys Arg Leu Cys Gly Gly Gly Gly Pro Trp Ala Gly Pro Lys Arij Arg 
65 70 75 80 

Lys Met Asp Phe Leu Leu Val Gin Leu Ala Leu Ala Asp Leu Tyr: Ala 



Cys Gly Gly Thr Ala Leu Ger Gin Leu Ala Trp Glu Leu Leu Gly Glu 
100 105 :10 

Pro Arg Ala Ala Thr Gly Asp Leu Ala Cys Arg Phe Leu Gin Leu Leu 
115 120 125 

Gin Ala Ser Gly Arg Gly Ala Ser Ala His Leu Val Val Leu lie Ala 
130 " 135 140 

Leu Glu Arg Arg Arg Ala Val Arg Leu Pro His Gly Arg Pro Leu Pro 
145 150 155 160 

Ala Arg Ala Leu Ala Ala Leu Gly Trp Leu Leu Ala Leu Leu Leu Ala 
165 170 175 

Leu Pro Pro Ala Phe Val Val Arg Gly Asp Ser Pro Ser Pro Leu Pro 
180 185 190 

Pro Pro Pro Pro Pro Thr Ser Leu Gin Pro Gly Ala Pro Pro Ala Ala 
195 200 205 

Arg Ala Trp Pro Gly Glu Arg Arg Cys His Gly He Phe Ala Pro Leu 
210 215 220 

Pro Arg Trp His Leu Gin Val Tyr Ala Phe Tyr Glu Ala Val Ala Gly 

225 230 235 240 

Phe Val Ala Pro Val Thr Val l,eu Gly Val Ala Cys Gly His Leu Leu 
245 250 255 

Ser Val Trp Trp Arg His Arg Pro Gin Ala Pro Ala Ala Ala Ala Pro 

260 265 270 

Trp Ser Ala Ser Pro Gly Arg Ala Pro Ala Pro Ser Ala Leu Pro Arg 

275 280 285 

Ala Lys Val Gin Ser Leu Lys Met Ser Leu Leu Leu Ala Leu Leu Phe 
290 295 300 

Val Gly Cys Glu Leu Pro Tyr Phe Ala Ala Arg Leu Ala Ala Ala Trp 
305 310 315 320 

Ser Ser Gly Pro Ala Gly Asp Trp Glu Gly Glu Gly Leu Ser Ala Ala 

325 330 335 

Leu Arg Val Val Ala Met Ala Asn Ser Ala Leu Asn Pro Phe Val Tyr 
340 345 350 

Leu Phe Phe Gin Ala Gly Asp Cys Arg Leu Arg Arg Gin Leu Arg Lys 
355 360 365 

Arg Leu Giy Ser Leu Cys Cys Ala Pro Gin Gly Gly Ala Glu Asp Glu 
370 375 380 

Glu Gly Pro Arg Gly His Gin Ala Leu Tyr Arg Gin Arg Trp Pro His 
385 390 395 400 
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?::■ Hi- Tyi P . r P r < ; Ala Arc; Am Glu Pi.-. L*..-\i Asp Plu 2jv Gly Le'.j 
4 OP " 4 10 >;p 

Ara Hro Pre Pre Pro Arq Pre Arq Pro Leu 7io Cyi- P • • : Py < Gli: Per 
'3 2 ( i 4 2 5 4 3 u 

Ala Phe 

• ._ 1 j • 23 
*.211> 1041 
<:212> DNA 

< : 2 1 3 > H orr.o s apiens 

«:-100> 23 

atgt.ica?.:g qqr.catqctq ccqcatcqaq ggggacacca tctcccaqqt gatgccgccq 6 0 

c r q c r c a 1 1 q t g g c: c 1 1. 1 g t g c t g g g c g c a ctaggcaarg gggtcqccct. q t q t g g t t t c 12 0 

tgcttccaca tgaagacctg gaagcccag: actgtttacc tttt.caat.tt qqccqtqqct 1 HO 

gatttcctc: tiatgatctq cctgcctttt eggacagae: at:ac:t:-ag a:gtagaca: 2-3 0 

tgggcttttg gggacattcc ctgccgagtg gggctcttca cgttgg;cat gaa:agggcc 300 

gggageatcg tgttccttac ggtggtggct gcggacaggt atttcaaagt ggtccacccc 360 

:accacgcgg t^aacactat ctccacccgg gtggcqgctg geategtctg caccctgtgq 4 20 

gccc*;ggtca t:ctgggaa: agtgtatc:: ttgctgqaga accat:t:tg cgtgcaagag 480 

acggccgtct cctgtgagag ctt:atcatg gagteggeca atggctggca tgacatcatg 54 0 

ttccagctgq agttctttat gcccctcggc atcatcttat tttget. crtt ;aagattgtt 60 0 

tggagcctga ggegqaggea gcagctggcc agacaggctc ggatgaagaa ggcgacccgg 660 

ttcarcatgg tggtggcaat tgtgttcatc acatgctacc tg::cagcgt gtctgetaga 720 

ctctatttc: tctgqacggt gccctcgag: gectgogate cct:tgtcca r. qqggccctg 7 80 

cacataacc; tcagcttcac ctacatqaac ageatgetgg atcccctggt gtattatttt 8 4 0 

tcaacjcccc: cctttcccaa attctacaac aagctcaaaa tetgeaqtct. qaaacccaaq P00 

caqcoagga: ac r . caaaaac acaaaggecg gaagagatgc caatttcgaa cot. eggtcqc 9 60 

aggaqttgea tcagtgtggc aaatagttto caaaqcc agt ctgatgggca atgggatccc 10 22 
cacattgttq ag^ggcactg a 104 1 

• ; 1 • 2 4 
-211:- 34 6 

PP.T 



<213 



Homo sapiens 



<4 0C> 2 4 

Met Tyr Asn PPL y Ser Cys Cys Arg lie Glu Giy Asp Thi He Ser Gin 
1 5 2 0 lb 

Vai Met Pro Pro Leu Leu He Val Ala Phe Vai Leu Gly Ala Leu Gly 

2 0 2 P 3 0 

Page 2 4 



WO 0 1/36-P1 



IK T/l SUO/3 1 5o*> 



Asn Gly Val Ai c , Leu ..: 7 \. Giy Phe Cys Phe His Met. Lys Thr Trp Lvs 
3 5 4 0 A : - 

Pro Ser Thr Val Tyr ley. Phe Asri Leu A J a Va.; Ala Asp Phe Leu Leu 
50 55 60 

Met lie Cys Leu Pro Phe Arq Thr Asp Tyr Tyr Leu Arc Arq Arq His 

65 70 75 80 

Trp Ala Phe Gly Asp lie Pro Cys Arg Val Gly Leu Phe Thr Leu Ala 
85 90 95 

Met Asn Arg Ala Gly Ser lie Val Phe Leu Thr Val Val Ala Ala Asp 
100 105 110 

Arg Tyr Phe Lys Val Val His Pro His His Ala Val Asn Thr He Ser 
115 120 125 

Thr Arg Val Ala Ala Gly He Val Cys Thr Leu Trp Ala Leu Val He 
13 0 135 140 

Leu Gly Thr Val Tyr Leu Leu Leu Glu Asn His Leu Cys Val Gin Glu 
145 150 155 160 

Thr Ala Val Ser Cys Glu Ser Phe He Met Glu Ser Ala Asn Gly Trp 
165 170 175 

His Asp lie Met Phe Gin Leu Glu Phe Phe Met Pro Leu Gly He He 
180 185 190 

Leu Phe Cys Ser Phe Lys He Val Trp Ser Leu Arg Arg Arg Gin Gin 
195 200 205 

Leu Ala Arg Gin Ala Arg Met Lys Lys Ala Thr Arg Phe lie Met Val 
210 215 2 2 0 

Val Ala He Val Phe lie Thr Cys Tyr Leu Pro Ser Val Ser Ala Arg 
225 230 235 240 

Leu Tyr Phe Leu Trp Thr Val Pro Ser Ser Ala Cys Asp Pro Ser Val 

245 250 255 

His Giy Ala Leu His He Thr Leu Ser Phe Thr Tyr Met Asn Ser Met 

260 265 270 

Leu Asp Pro Leu Val Tyr Tyr Phe Ser Ser Pro Ser Phe Pro Lys Phe 

275 280 285 

Tyr Asn Lys Leu Lys He Cys Ser Leu Lys Pro Lys Gin Pro Gly His 
290 295 300 

Ler Lys Thr Gin Arg Pro Glu Glu Met Pro lie Ser Asn Leu Gly Arg 
:-05 310 315 320 

Arg Ser Cys lie Ser Val Ala Asn Ser Phe Gin Ser Gin Ser Asp Gly 
325 330 335 

Gin Trp Asp Pro His lie Val Glu Trp His 



340 



345 




25 

1011 

DNA 

Homo 



sapiens 
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a tci-.iac.: :i c a a t Ll c ,i - c a t q lattcajcc^ tcianiatct cttccc.it qrjc ttt.accaatc » u 

at tt acatcc tcc:ttg:at tgttggtgtt 1 1 1 g cr a a a c a c:.c;.ctct c a dtqaatat t t 1 2 0 

Ltaacaaaaa ta^qUiacida aacdtcaacq cacat. c tacc rqtcaca;;ct t g t g a c t c c a 1 8 0 

jacttacttc t. q t g c a g t q c catccctrtc atgaqt^tcr atttccrqa^: d:;gtt:c:aa 2 -1 0 

tagqaatatc aatcrqctca atgcagaqtq cicaattttc t caqaacrct atccato:at 30C 

gcaagtatgt ttgtoagtct cttaatttta agttggattg ccataaqccg ctatgctacc 360 

ttaatgcaaa aggattcctc gcaagagact acttcatgct atgagaaaat attttatgqc 4 2 j 

cat. t tact ga a^ j aa r. 1 1 eg ccaqcccAcii: tttg::agaa aactatgcat ttacacatqg 4 '-3 0 

ggagt.tqtac : ^ggcataat cattccaqtt accgtatact aetcagtcat agaggctaca 

gaaggaqaag aqugcctatg ctacaatcgg cagatggaac tagqagccat qarctctcag 600 

. a 1 1 . g c a g g t c t ; a 1 1 g g a a c cacatttatt ggattttcct 1 1 1 1 a g t a g r. actaacatca 6 6 0 

cactactctt ttgtaagcca tctgagaaaa ataagaacct gtacgtccat tatggagaaa 7 20 

gatttgactt acagttctgt gaaaagacat cttttggtca tccagattct actaatagtt 7 H 0 

tgcttccttc ctt.atagtat ttttaaaccc attttttatq ttcta caeca aaqagataac 8-5 0 

-gt caeca at tgaattattt aatagaaaca aaaaacattc tcacctgtct tgc tt egg cc 900 

agaagtagca cagaccccat tatatttctt ttattagata aaacattcaa gaagacacta 9 60 

tataatctct ttacaaagtc taattcagca catatgeaat catatggt tg a 1011 

! 1 0 > 2 6 
! 1 : > 3 3 6 
?12> PRT 

!15> Homo cap: ens 
<400> 26 

Met Asn Asn Asn Thr Thr Cys lie Gin Pro Ser Met He Ser 3er Met 

1 5 10 lb 

Ala Leu Pro He He Tyr He Leu Leu Cys He Val Gly Va .1 ?he Gly 

2 0 2 5 3 0 

Asn Thr Leu ,?er Gin Trp He Phe Leu Thr Lys He Gly Lys Lys Thr 

3 5 4 0 A 5 

Ser Thr Hie He Tyr Leu Ser His Leu Val Thr Ala Asn Leu Leu Val 

5 C 5 5 6 0 

Cys Ser Ala Met Pro Phe Met Ser He Tyr Phe Leu Lys Gly Phe Gin 

65 70 75 80 

Trp Glu Tyr Gin Ser Ala Gin Cys Arg Val Val Asn Phe Leu Gly Thr 

85 90 95 

Leu Ser Met His Ala Ser Met Phe Val Ser Leu Leu lie Leu Ser Trp 

100 105 HO 

Page 2 6 
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11^ Ala "lie Ser Arg Tyr Ala Th: Leu Met Gin Lys Asp Ser Sor Gin 
; i' 12C 125 

Glu Thr Thi Ser Cys Tyr Glu Lys He Phe Tyr Gly His Leu Leu Lys 
13 0 13 5 14 0 

Lys Phe Arg Gin Pro Asn Phe Ala Arq Lys Leu Cys lie Tyr He Trp 
145 150 " 155 ' 160 

Gly Val Val Leu Gly He He lie Pro Val Thr Vai Tyr Tyr Ser Val 
165 170 175 

He Glu Ala Thr Glu Gly Glu Glu Ser Leu Cys Tyr Asn Arg Gin Met 
180 185 " 190 

Glu Leu Gly Ala Met He Ser Gin He Ala Glv Leu lie Gly Thr Thr 

195 200 " 205 

Phe lie Gly Phe Ser Phe Leu Val Val Leu Thr Ser Tyr Tyr Ser Phe 

210 215 220 

Val Ser His Leu Arg Lys He Arg Thr Cys Thr Ser He Met Glu Lys 

225 230 235 240 

Asp Leu Thr Tyr Ser Ser Val Lys Arg His Leu Leu Val He Gin lie 

245 250 255 

Leu Leu lie Val Cys Phe Leu Pro Tyr Ser He Phe Lys Pro He Phe 

260 265 270 

Tyr Val Leu His Gin Arg Asp Asn Cys Gin Gin Leu Asn Tyr Leu He 
275 280 285 

Glu Thr Lys Asn lie Leu Thr Cys Leu Ala Ser Ala Arg Ser Ser Thr 
290 295 300 

Asp Pro lie lie Phe Leu Leu Leu Asp Lys Thr Phe Lys Lys Thr Leu 
305 310 315 320 

Tvr Asn Leu Phe Thr Lys Ser Asn Ser Ala His Met Gin Ser Tyr Gly 
325 330 335 

<210> 27 
<-'211> 1014 

<.;:12>* dna 

^.213> Homo sapiens 

<400> 27 

atgaatgagc cactagacta tttagcaaat gcttctgatt tccccgatta tgcagctgct 60 

tttggaaatt gcactgatga aaacatccca ctcaagatgc actacctccc tgttatttat 120 

ggcattatct tcctcgtggg atttccaggc aatgcagtag tgatatccac ttacattttc 180 

aaaatgagac cttggaagag cagcaccatc attatgctga acctggcctg cacagatctg 240 

ctgtatctga ccagcctccc cttcctgatt cactactatg ccagtggcga aaactggatc 300 

tttggagatt tcatgtgtaa gtttatccgc ttcagcttcc attteaacct gtatagcagc 360 

atcctcttcc tcacctgttt cagcatcttc cgctactgtg tgatcattca cccaatgagc 420 

tgcttttcca ttcacaaaac tcgatgtgca gttgtagcct gtgctgtggt gtggatcatt 480 

tcactggtag ctgtcattcc gatqaccttc ttgatcacat caaccaacag gaccaacaga 540 

Paae 27 



WO 01 3<>-Tl 



P( I I sou 3150*) 



r ;.i_:cctqtc t cgacct rac caqttcqy.it aa^:t;:^!.n c'..<ut taagr.q qtac,:.ir:t q t • 

a ; f ; t g a c t. q c a a c *: a c t 1 t c t g c s r c c c c t 1 a a t q a t a j tgacactct q c tdta: c »-i v.- q o 6 0 

0 t t a t c c a c a ctctqacccj t g g a c t a c a a c t q a c q r g c c tiaa g c a qaaaqc <j c g a "a 0 
r.qqctaacca t Let get cie t ecttqeattt tacatatgtt :tttaoc:tt ccatatcttg 7 8 2 
a q g g t c a 1 1 e a g a t c g a a t c t c g c c t g c 1 1 t c a a t c a g 1 . t g 1 1 c c: a 1 1 g a ga^tcag a t c 3 4 ( J 
catyaaqctt a eat. agtttc tagaccatta qotqctctqa acacc:tttgg taacctgt ta ^Ut 
c t a t a t g t g ^ t g g t c a g c g a caactttcaq caggctgtct g c tcaacagt gagatqeaaa 9 6 0 
gtaagcgqgs acettgagca agcaaagaaa attagttact caaacaaccc ttga 1014 

v 2 1 :.. • 2 8 

<211-> 3 3 7 

• 210 • PRT 

< 2 i 3 > H o:r. o ^,.ip ; e n s 

<■ 4 0 0 > 2 8 

Met Asn Glu Pro Leu Asp Tyr Leu Ala Asn Ala Ser Asp Phe Pro Asp 

1 5 10 15 

Tyr Ala Ala Ala Phe Gly Asn Cys Thr Asp Glu Asn lie Pro Leu Lys 
20 25 30 

Met His Tyr Leu Pro Val lie Tyr Gly He He Phe Leu Val Gly Phe 
35 40 45 

Pro Gly Asn Ala Val Val lie Ser Thr Tyr He Phe Lys Met Arg Pro 

5C 5 5 60 

Trp Lys Ser Ser Thr lie He Met Leu Asn Leu Ala Cys Thr Asp Leu 

6 5 7 0 "5 8 0 

Leu Tyr Leu Thr Ser Leu Pro Phe Leu lie His Tyr Tyr Ala Ser Gly 

85 90 95 

Glu Asn Trp He Phe Gly Asp Phe Met Cys Lys Phe He Arg Phe Ser 
100 105 110 

Phe His Phe Asn Leu Tyr Ser Ser He Leu Phe Leu Thr Cys Phe Ser 
115 12 0 12 5 

lie Phe Arg Tyr Cys Val He He His Pro Met Ser Cys Phe Ser He 
130 13 5 140 

His Lys Thr Arg Cys Ala Val Val Ala Cys Ala Val Val Trp lie lie 

143 150 155 160 

Ser Leu Va: A. a Val He Pro Met Thr Phe Leu He Thr Ger Thr Asn 
165 17 0 17 5 

Arg Thr Asn Arg Ser Ala Cys Leu Asp Leu Thr Ser Ser Asp Glu Leu 
180 185 190 

Asn Thr He Lys Trp Tyr Asn Leu He Leu Thr Ala Thr Thr Phe Cys 

195 ' 200 205 

Leu Pre Leu Val He Val Thr Leu Cys Tyr Thr Thr He He His Thr 

Paqe 2b 
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2 10 210 2 2 0 

Leu Thr His Gly Leu Gin Thr 7*sp Ser Cys Leu Lys Gin Lys Ala Arg 
220 230 " 230 240 

Arg Leu Thr lie Leu Leu Leu Leu Ala Phe Tyr Val Cys Phe Leu Fro 
245 250 255 

Phe His He Leu Arg Val lie Arg He Glu Ser Arq Leu Leu Ser lie 
260 265 " 270 

Ser Cys Ser He Glu Asn Gin He His Glu Ala Tyr He Val Ser Arg 
275 280 285 

Pro Leu Ala Ala Leu Asn Ihr Phe Gly Asn Leu Leu Leu Tyr Val Val 

290 295 300 

Val Ser Asp Asn Phe Gin Gin Ala Val Cys Ser Thr Val Arg Cys Lys 

305 310 315 320 

Val Ser Gly Asn Leu Glu Gin Ala Lys Lys He Ser Tyr Ser Asn Asm 
325 330 335 

Pro 



<:21Q> 29 

<211> 993 

<C12> DMA 

■213> Homo sapiens 

•:400> 2 e < 



atggatccaa 


ccaccccggc 


c tggggaaca 


gaaagtacaa 


cagtgaatgg 


aaatgaccaa 


qcccttcttc 


tgctt t gtgg 


caaggagacc 


ctgatcccgg 


tcttcctgat 


ccttttcatt 


accctggtcg 


ggctggtagg 


aaacgggtt t 


gtgctctggc 


tcctgggct t 


ccgcatgcgc 


aggaacgcct 


tc tctgtcta 


cgt cctcagc 


ctggccgggg 


ccgacttcct 


cttcct ct gc 


ttccagatta 


taaattgcct 


ggtgtacctc 


agtaacttct 


tctgttccat 


ctccatcaat 


ttccctagct 


t ct t caccac 


tgtgatgacc 


tgtgcctacc 


ttgcaggcc t 


gagcatgctg 


aqcaccgtca 


gcaccgagcg 


ctgcctgtcc 


gtcctgtggc 


ccatctggta 


t cgctgccgc 


cgccccagac 


acctgtcagc 


ggtcgtgtgt 


gt cctgctct 


gggccctgtc 


cctactgctg 


agcatcttgg 


aagggaagt t 


ctgtggct tc 


ttatttagtg 


atggtgactc 


tggt tggt gt 


cagacatttg 


at ttcatcac 


tgcagcgtgg 


ctgatttttt 


tattcatggt 


tctctgt ggg 


t ccagtctgg 


ccctgct ggt 


caggat cctc 


tgtggctcca 


ggggtctgcc 


ac tgaccagg 


ctgtacctga 


ccatcctgct 


cacagtgctg 


gtgttcctcc 


tctgcggcct 


gccctttggc 


a ttcagtggt 


tcctaa tat t 


atggatct gg 


aaggatt ctg 


atgtcttatt 


ttgtcatatt 


catccagt tt 


cagttgt cct 


gt catctctt 


aacagcagtg 


cca.acccca.t 


catttacttc 


tt cgtgggct 


ct tttaggaa 


gcagtggcgg 


ctgcagcagc 


cgat cctcaa 


gctggctct c 


cagagggctc 


tgcaggacat 


tgctgaggtg 


gatcacagtg 


aaggatgctt 


ccgt cagggc 


accccggaga 


tgt cgagaag 


cagtctggtg 


tag 







6 0 
120 
180 

2-; o 

3 00 

3>;o 

42 0 
A SO 
54 0 
600 
6 00 
i . 0 
780 
& 4 0 
900 
960 
9 93 
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-.210 • 3 0 

.2 i ". 330 

<2l:>> FPT 

2 1 3 > H one sapiens 

■ 4 0 0 > 3 0 

Met Asp Pro Thr Thr Pro Ala Trp Gly Thr Glu Ser Thr Thr Val Asn 
1 5 10 15 

Gly Asn Asp Gin Ala Leu Leu Leu Leu Cys Gly Lys Glu Thr Leu lie 

20 25 30 

Pro Val Phe Leu He Leu Phe lie Ala Leu Val Gly Leu Val Gly Asn 

3 5 4 0 4 5 

Gly Phe Val Leu Trp Leu Leu Gly Phe Arq Met Arg Arg Asn Ala Phe 

r >L' 5 5 60 

Cer V<: I iy: Val Leu Ser Leu Ala Gly Ala Asp Phe Leu Phe Leu Cys 
• 7 0 7 5 8 0 

Pne Gin lie lie Asn Cys Leu Val Tyr Leu Ser Asn Phe Phe Cys Ser 
85 90 95 

lie G^r lie Asn Phe Pro Ser Phe Phe Thr Thr Val Met Thr Cys Ala 

100 105 110 

Tv: Lo.i A 1 ci Gly Leu Ser Met Leu Ser Thr Val Ser Thr Glu Arg Cys 
115 120 125 

Leu Gr-r val Leu Trp Pre lie Trp Tyr Arg Cys Arg Arg Pro Arg His 
13m 135 140 

Leu Set Ala Val Val Cys Val Leu Leu Trp Ala Leu Ser Leu Leu Leu 
145 150 155 160 

Ser lie L-u Glu Gly Lys Phe Cys Gly Phe Leu Phe Ser Asp Gly Asp 
165 170 ' 175 

Ser Gly Trp Cys Gin Thr Phe Asp Phe He Thr Ala Ala Trp Leu He 
1 b 0 1 8 5 19 0 

Phe Leu Phe Met Val Leu Cys Gly Sei Ser Leu Ala Leu Leu Val Arg 
195 200 205 

He Lei] "vs Gly Ser Arg Gly Leu Pro Leu Thr Arg Leu Tyr Leu Thr 

210 ' 215 220 

He Leu U-<; Thr Val Leu Val Phe Leu Leu Cys Gly Leu Pro Phe Gly 

22b 23C 235 240 

He Gin. Tip Phe Leu He Leu Trp He Trp Lys Asp Ser Asp Val Leu 
245 250 255 

Phe Cys His lie His Pre Val Ser Val Val Leu Ser Ser Leu Asn Ser 
260 265 270 

Ser Ala Asn Pro He He Tyr Phe Phe Val Gly Ser Phe Arg Lys Gin 

275 280 2B5 

Trp Arg Leu Gin Gin Pro Tie Leu Lys Leu Ala Leu Gin Arg Ala Leu 

290 295 300 
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P( I/l S0O/3I509 



G In Asp lie Ala G 1 u V a 1 A s p H i. .y Ser G 1 u G 1 y C y s p h c A x q G 1 r. G 1 v 
305 310 * 315 " " 320 

Thr Pre Glu Met Ser Arc Ser Ser Leu Val 

325 " 330 

<210> 3 1 

:211;- 1092 

■;212:- DNA 

•'■213-- Homo sapiens 

•:400.^ 31 



cggcccccg 


gegaggeget 


gc tggcggg t 


ctcctggtga 


tggt act ggc 


cgtggcgctg 


60 


OI3tCCd3Cg 


LdLiygiy ^ l. 


gctttgttgc 


gcctacagcg 


ctgagctccg 


cactcgagcc 


1 2 0 


t :agccgtcc 


tcctggtgaa 


tctgtcgctg 


ggccacct g c 


tget ggegge 


get gga ca t g 


iti 


coct t caege 


tgctcggtgt 


ga tgcgcggg 


egga ca ccg t 


cggcgcccgg 


cgcatgccaa 


2 4 0 


g t c a t t g g c t 


tcctggacac 


cttcctggcg 


tccaacgcgg 


cgctgagcgt 


ggcggcg -tg 


3 0 0 


rigcgcaga.ee 


agtggctggc 


agtgggcttc 


ccactgcgct 


acgccggacg 


cctgcgaccg 


3 60 


cgctatgccg 


gcctgctgct 


gggctgtgcc 


t ggggacagt 


cgctggcct t 


ctcaggcgct 


4 2 0 


gcacttggct 


gctcgtggct 


tggctacagc 


agcgccttcg 


cgtcctgttc 


getgegc^tg 


4 HO 


c cgcccgagc 


ctgagcgtcc 


gcgcttcgca 


gect tcaccg 


ccacgc t cca 


tgccgtg ggc 


54 0 


i.-:cgtgctgc 


cgctggcggt 


gctctgcctc 


acctcgctcc 


aggtgcaccg 


ggtggcacgc 


60 0 


agccactgcc 


agcgcatgga 


caccgt cacc 


a tgaaggege 


tcgcgctgct 


cgccgacctg 


6 


caccccagtg 


tgcggcagcg 


ctgcctcatc 


cagcagaagc 


ggcgccgcca 


ccgcqccdcc 


7 2 1"; 


a qgaagat tg 


gca ttgetat 


tgcgaccttc 


ctcatctgct 


tt gccccgt a 


tgtcatgacc 


7 8 0 


aqget ggegg 


agct cgtgcc 


cttcgtcacc 


gtgaacgccc 


agtggggcat 


cctcagcaag 


84 0 


t.gcctgacct 


acagcaaggc 


ggtggccgac 


ccgttcacgt 


actctctgct 


ccgccggccg 


90 0 


t tccgccaag 


tcctggccgg 


catggtgcac 


eggctgetga 


agagaacccc 


gcgcccagca 


9 60 


t.ccacccatg 


acagct ctct 


ggatgtggcc 


ggcatggtgc 


accagctgcc 


gaagagaacc 


102 0 


ccgcccccag 


cgtccaccca 


caacggctct 


gtggacacag 


agaatgattc 


ctgcctgcag 


1080 


cagacacact 


ga 










10*2 



vlil0> 32 

■:;:n> 363 
• 2 12 > PRT 
*'213'-- Homo sapiens 

-.4 00 • 32 

Met Gly Pro Gly Glu Ala Leu Leu Ala Gly Leu Leu Val Met Val Leu 
15 10 15 

Ala Val Ala Leu Leu Ser Asn Ala Leu Val Leu Leu Cys Cys Ala Tvr 

20 25 30 
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LOU H I O Jtil 



Thi Ar j A 1 u 5"i Cilv Vul Leu U-u Val Ar-.n Lou 



Lie: Lou ; v His Leu L-u Leu A 1 a Ala Lou Asp Met Pro I 'ho Tn: Lou 



Lou G 1 y V a I M o t A ro G 1 v A r c T t : r P r < > S e r A la P r o G I v A 1 a C v s G .: n 

f.3 " ~s' - m,> 

Val lie Gly ?he Leu Asp; Thr Lhe Lou Ala Ser Asn Ala Ala Lou Ser 
.- : . 9 0 9 5 

Val Ala Ala Leu Ser Ala Aso Gin Trp Leu Ala Val Gly Phe Pro Leu 
100 ' 105 110 

Ara Tyr Ala Gly Arc Leu Arq Pro Arc Tyr Ala Gly Leu Leu Leu Gly 

115 ' " ' 120 125 

Cys Ala Tr p G 1 y Gin Ser Leu Ala P h e Ser G J y Ala Ala Lou G 1 y G y s 
130 135 140 

Ser Trp Leu Gly Tyr Soi Ser Ala Phe Ala Ser Cys Ser Leu Arg Lou 
145 150 155 1 60 

Pro Pro Glu Pro Glu Arg Pro Arg Phe Ala Ala Phe Thr Ala Thr Leu 
165 170 17 5 

His Ala Val Gly Phe Val Leu Pro Leu Ala Val Leu Cys Leu Thr Ser 

ISC 185 190 

Leu Gin Val His Arq Val Ala Arg Ser His Cys Gin Arg Met Asp Thr 
195 " 200 205 

Val Thr Met Lys Ala Leu Ala Leu Leu Ala Asp Leu His Pro Ser Val 
210 215 220 

Arg Gin Arc Cys Leu lie Gin Gin Lys Arg Arg Arg His Arg Ala Thr 

225 ' 2 30 2 35 24 0 

Arg Lys Tie Gly He Ala lie Ala Thr Phe Leu He Cys Phe Ala Pro 
24 5 2 50 2 55 

Tyr Val Met Thr Arq Lou Ala Glu Leu Val Pro Phe '2a 1 Thr Val Asn 

2 60 " 2 6b 27 0 

Ala Gin Tro Gly Tie Lou Ser Lys Cys Leu Thr Tyr Ser Lys Ala Val 

275 280 285 

Ala Asp Pro Phe Thr Tyr Ser Leu Leu Arg Arg Pro Phe Arq G 1 r. Val 
2 90 295 300 

Leu Ala Gly Met Val His Arg Leu Leu Lys Arq Thr Pro Arg Pro A La 
30 5 310 315 32 0 

Ser Thr His Aso Ser Ser Leu Asp Val Ala Gly Mot Val His Gin Leu 
325 330 335 

Leu Lys Arq Thr Pro Arg Pro Ala Ser Thr His Asn Sly Ser Val Asp 
340 ~ 345 350 

Thr Glu Asr. Asp Ser Cys Leu Gin Gin Thr His 

35 5 360 

<:210> 33 
<211> 112 5 
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PC l /l SOO/3 15U*> 



:2 12> DNA 

: 2 1 3 > H omo sap i e r \ 



■r-JOO> 3 3 
atgcccacac 


tcddtacttc 


tgcct c t cca 


cccacattct 


tctgggccaa 


tgcctcc:gga 


g q c a g t g t g c 




cqatgctccg 


atgcctgtca 


aattcctagc 


cc tgaggctc 


atggttgccc 


tggcc tatgg 


gcttgtgggg 


gccaitggct 


tgc tgggaaa 


tttggcggtc 


ctgtqggtac 


tgagraactg 


tgcccggaga 


gcccctggcc 


caccttcaga 


caccttcgtc 


titcac.cctqg 


ctctggcgga 


c ^tgggact g 


gcact cactc 


t cccc ttt tg 


ggcagecgag 


tcggcactgg 


actttcactg 


gcccttcgga 


ggt gccctct 


gcaagatggt 


tctgacggcc 


ctctgtcct ca 


acgtctatgc 


cagcatcttc 


ctcatcacag 


cgct gagcgt 


tgctcgctac 


tgqgt ggccg 


ccatggctgc 


ggggccaggc 


acccacctct 


cactct tctg 


ggcccgaa ta 


qccaccc tag 


c a g t g t g g g c 


ggcggctgcc 


ctggtgacgg 


tgcccaca gc 


tgtcttcggg 


g t qgaggq t g 


aggtgtgtgg 


tg tgcgcct t 


tgcct gctgc 


gt 1 1 ccccag 


caggtac tgg 


c tggg ggcct 


accagctgca 


gagggtggtg 


ctggctttca 


t gg t gccc 1 1 


gggcgt ca t c 


accac cagct 


acctgctgct 


get ggcc 1 1 c 


ctgcagcggc 


gg caa c gg eg 


geggcaggae 


■ : :qcagggt.c q 


t-jgcccgctc 


tgtccgcatc 


ctggtggc tt 


ccttct tcct 


ctgctggt tt 


ccca>jcci i g 


tqgtcactct 


ctggggtgtc 


ctggtgaagt 


ttgacctggt 


gccctggaac 


agtac^t '_C t. 


a':acrarcca 


gacgtatgtc 


ttccctgtca 


ctact tgett 


ggcacacagc 


daiaar t q<: c 


tcaaccctgt 


gctgtactgt 


ctcctgaggc 


gggagccccg 


gcaggctctg 


uoaggcci t 


t raqggatct 


gcggtcgagg 


ctgtggcccc 


agggeggagg 


ctgggtgcaa 


caggc qq cc c 


c iaagcaggt 


aggcaggcgg 


tgggtcgcaa 


gcaacccccg 


ggagagcege 


ccttc:accc 




cct ggacaga 


gggacacccg 


ggtga 





-_1C 34 

■ 211 374 

•112 • PRT 

:213> Homo sapiens 



4 0 3 • 3 4 

Met Pro Thr Leu Asn Thr Ser Ala Ser Pro Pro Thr Phe Phe Trp Ala 
] 5 10 15 

Asn Ala Ser Gly Gly Ser Val Leu Ser Ala Asp Asp Ala Pro Met Pro 

20 25 30 

Val Lys Phe Leu Ala Leu Arg Leu Met Val Ala Leu Ala Tyr Gly Leu 
3 5 4 0 4 5 

Val Gly Ala lie Gly Leu Leu Gly Asn Leu Ala Val Leu Trp Val Leu 

50 55 60 

Ser Asn Cys Ala Arq Arg Ala Pro Gly Pro Pro Ser Asp Thr Phe Val 

b5 " 70 75 80 
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60 
12 0 
180 
24 0 
3 0 0 

3 60 

4 2 0 
4 8 0 
540 
60 0 
660 
7 ? 0 

H4 0 
900 
9 60 
1020 
1080 
1125 



\\ () ni 3<»4^l 



Ph- I. - 1 . I.-i: Ala Asp Leu Gly Leu A 1 a :-.-u Chi Ire Phe 

.' ■ 1 . h r 1 ^ t; . 

Trp AU Ala Gru Se: Ala l.eu Asp Phe H - : ■ Trp Pro Phe Gi y uly Ala 

lOu 105 1.0 

Leu Cys Lys Met Val Lou Thr Ala Thr Val Leu Asn Val Tyr Aid Ser 

us i : o i:o 

lie Phe Leu 11^ Thr Ala Leu Ser Val Ala Arq Tyr Tr: Val Val Ala 

1 3 0 1 3 5 1 4 0 

Met Ala Ala Gly Pro Gly Thr His Leu Ser Leu Phe Trp Ala Arg He 
145 150 155 160 

Ala Thr Leu Ala Val Trp Ala Ala Ala Ala Leu Val Thr Val Pro Thr 

165 170 175 

Ala Val Phe Gly Va i Glu Gly Glu Val Cys Gly Val Arq Leu Cys Leu 
18 0 18 5 1 9 0 

Leu Arg Phe Pro 'ior Arg Tyr Trp Leu Gly Ala Tyr Gin Leu Oln Arq 

19b 200 205 

Val Val Leu Ala Phe Met Val Pro Leu Gly Val He Thr Thr Ser Tyr 
210 215 22 0 

Leu Leu Leu Leu Ala Phe Leu Gin Arg Arg Gin Arg Arg Arg Gin Asp 
225 230 235 ~ 240 

oer Arg Val Val Ala Arg Ser Val Aig lie Leu Val Ala Ser Phe Phe 

24 5 250 2 55 

Leu Cys Trp Phe Pro Asn His Val Val Thr Leu Trp Gly Val Leu Val 

260 265 270 

Lys Phe Asr Leu Val Pre Trp Asn Ser Thr Phe Tyr Thr He Gin Thr 

2 75 28 0 28 5 

Ty: Val Phe • ' r . Val Thr Thr Cys Leu Ala His Ser Asn Ser Cys Leu 
:90 2 95 3 00 

.Asn Pro Val Leu Tyr Cys Leu Leu Arg Arq Glu Pro Arq Gin Ala Leu 

305 310 315 32 0 

A_a Gly Thr Phe Aro Asp Leu Arg Ser Arq Leu Trp Pro Gin Gly Gly 

32b 330 335 

Gly Trp Val Gin Gin Val Ala Leu Lys Gin Val Gly Arq Arc Trp Val 
340 345 350 

Aia Ser Asn Pre Arc Glu Ser Arg Pro Ser Thr Leu Leu Thr Asn Leu 
?55 " 360 365 

A u p A r g G 1 y T h r Pro Gly 

2 7 0 

-210> 3 5 

• 311 10 92 

-212:- DNA 

<"213> Homo sapiens 

<AQ0> 3 5 

ataaategge accatct^ca ggatcacttt ctg jaaatag a c a a g a a g a a etgcLgtgtg 
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a -7 1 1 c a 1 1. g t 


c a a g g t g 1 1. g 


ccgccggtgt 


tggggcr. gga 


-4 tt. tatcttc 




gggct :c:gc 


q c a a t q gect 


tg :cct gtgg 


atrc:c:gr.t 


tccacctcaa 


gtcctggaaa 


180 


tocagc c g g a 


rtrtcctgtt 


caacctggca 


gtggctgact 


ttctac t g a t 


cat. :tgcctc 


240 




tggaiaacta 


tgt gaggcg t 


tgg gaetgga 


agt ttgggqa 


cat:ccttgc 


3 0 J 


c-ggctgat n:; 


:::t:atgtt 


ggcta tgaac 


cgccagggca 


gcatcatct t 


cct ;acggtg 


3 60 


gtggcggtag 


acaggtattt 


ccgggtggtc 


catccccacc 


acgccctgaa 


caagatctcc 


4 20 


aatcggacag 


cagccatca t 


ctcttgcctt 


ctgtggggca 


tcact at tgg 


cctgacagt c 


4 80 


r;acctcct 3a 


agaagaagat 


gccgatccag 


aatggcggtg 


caaatttgt g 


cagcagcttc 


54>) 


agca tctgoc 


ataccttcca 


gtggcacgaa 


gccatgttcc 


tcctggagtt 


cttcctgccc 


60 0 


r:tgggcatca 


t :ctgttctg 


ct cagecaga 


attatctgga 


gect geggea 


gagacaaa tg 


6 6 J 


qaccggcatg 


craagatcaa 


gagagecate 


acct tcatca 


tggtggtggc 


categtcttt 


720 


gtcatctgct 


tccttcccag 


cgtggttgtg 


cggatccgca 


tctt ctggc t 


cctgcacact 


780 


tcgggcacgc 


agaa ttgtga 


agtgtaccgc 


tcggtggacc 


r.ggcgttctt 


tatcactctc 


840 


agcttcacct 


aca tgaacag 


catgetggae 


cccgtggtgt 


actactt ct c 


cagcccatcc 


900 


tttcccaact 


tcttctccac 


tttgatcaac 


cget gcctcc 


agaggaaga t 


gacaggtgag 


960 


ccagataata 


accgcagcac 


gagegt cgag 


ctcacagggg 


accccaacaa 


aaccagaggc 


1020 


get ccagagg 


cgttaatggc 


caactccggt 


gagccatgga 


gcccctctta 


t ctgggccca 


1080 


acct ctcctt 


aa 










1092 



v210> 3 6 

<211> 3n3 

<212> PRT 

<213> Homo sapiens 

-.400> 36 

Met Asn Arg His His Leu Gin Asp His Phe Leu Glu lie Asp Lys Lys 
2 " 5 10 15 

Asn Cys Cys Val Phe Arg Asp Asp Phe lie Val Lys Val Leu Pro Pro 
20 * 25 30 

Val Leu Giy Leu Glu Phe lie Phe Gly Leu Leu Gly Asn Gly Leu Ala 
35 40 45 

Leu Trp He Phe Cys Phe His Leu Lys Ser Trp Lys Ser Ser Arg lie 

50 55 60 

Phe Leu Phe Asn Leu Ala Val Ala Asp Phe Leu Leu lie lie Cys Leu 
65 70 75 80 

Pro Phe Leu Met Asp Asn Tyr Val Arg Arg Trp Asp Trp Lys Phe Gly 

8 5 90 95 

Asp lie Pro Cys Arg Leu Met Leu Phe Met Leu Ala Met Asn Arg Gin 

100 105 110 
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IH I I SOU/3150*) 



! y lie Lie E-he Lei: Thr Val Val AH Va 1 Asp^ A : q Tyr. Phe Arq 

1 i \. 12 o : ;• •" 

Val VH Pro Hi? His Ala Leu A;~ti Ly;: II- Ser As-: Arq Thr AH 

13 0 i 3 : ' 14 J 

Ah: lie lit? Set Cys Leu Leu Trp Gly lie Thr lie Gly Leu Thr V>. 1 

14; 150 1 5 L I t,,, 

His. Leu Leu Lys Lys Lys Met Pre; I:e Gin Asn Gly Gly Ala Asn Leu 

It: I 7 0 1 " 5 

Cys Ser Ser Phe Ser He Cys His Tnr Phe Glri Trp; His Glu Ala Met 

180 IBS 190 

Phe Leu Leu Glu Phe Phe Leu Pro Leu Gly He He Leu Phe Cys Ser 

1 9 0 2 0 0 2 0 5 

Ala Arg He He Trp Ser Leu Arq Gin Arq Gin Met Asp Arq His AH 

2 10 215 ~ ~ 220 

Lys lie Lys Arq Ala He Thr Pno He Met Val Val Ala He Val Phe 

2 2b " 2 30 2 3b 240 

Val lie Cys Phe Leu Pro Ser Val Val Val Arg He Arg He Phe Trp 

245 250 255 

Leu Leu His Thr Ser Gly Thr Gin Asn Cys Glu Val Tyr Arg Ser Val 

260 265 270 

Asp Leu Ala Phe Phe lie Thr Leu Ser Phe Thr Tyr Met Asn Ser Met 

2 7 5 2 8 0 2 8 5 

Leu Asp Pro Val Val Tyr Tyr Phe Ser Ser D ro Ser Phe Pro Asr. Phe 

2 9 0 2 9 5 3 0 0 

Phe Ser Thr Leu He Asn Arg Cys Leu Gin Arq Lys Met Thr Gly Glu 

Vy- 310 H5 320 

Pro AoD Asn Asn Arq Ser Thr Ser Val Glu Leu Thr Gly Asp Pro Asn 

325 330 * 335 

Lys Thr Arc Gly Ala Pro Glu Ala Leu Met Ala Asn Ser Gly Glu Pro 

"34 0 34 5 3 50 

Trp Ser Pro Ser Tyr Leu Gly Pre- Thr Ser Pro 

3 5 5 3 6 0 



3 11 
2 12 
2 13 



104 4 
DNA 

Home sapien.: 



■:A JO - 3 7 

atoqgqgatg aqctqgcacc ttgccctgtg gqcactacag cttqgccqqc cctgatcoaq 60 

::catcagca agacaccctg catgccc:aa qcagccagca acacttcctt gggcctaggg 12 0 

qac:tcagqg tqcocaqctc catgetgtae tggzttttcc ttcccteaag cctgctgqct 18 0 

acagccacac tqgctgtcag ccccctgctg ctggtgacca tectgeggaa ccaacggctg 24 0 

cqaragqaqc cecactacct gct.ee eg get. aacatcctgc tc:caqac:t gqectacatr. 30 0 

ctcetccaea t get cat etc ctccagcagc ctgggtggct gggagctggg ccgcatqgcc 360 
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tqt gac, :tc tcactqatgc tgtcticgcc gcctgca:c;i gcaccatccL gtccttcacc ^20 

qocatutqc tgcaca:cta cctggcagtc atccat:c:ac tgcqctacct c:ccttcatg 4^0 

'::ccatcggq ctgcctggaa ggcagtggcc c:catctggc tggtggoctg ctgcttcccc c 1 ' 

a;attcc:ta tttggctcag -aagtggcag gatgcccagc tggaggagca aqgagcitca 60 0 

f.acatcciac caccaagcat gggcacccag cc gggatgtg gcctcctggt cattgttacc 6 60 

tacacctcca ttctgtgcgt tctgttcctc tgcacag;tc tcattgccaa ctgtttctgg 720 

aggatctatg cagaggccaa gacttcaggc atctgggggc agggctatto ccgggccagg 780 

ggcaccctgc tgatccactc agtgctgatc acattgtacg tgagcacagg ggtggtgttc 610 

r.ccctggaca tggtgctgac caggtaccac cacattgact ctgggact ca cacatggctc 900 

■:; r .ggcagcta acagtgaggt actcatqarg cttccccgtg ccatgctccc atacctgtac 9 60 

■:tgctccgct accggcagct gttgggcatg gtccggggcc acctcccatc caggaggcac 1020 

:dggccatct ttaccat:tc ctag 1044 

:210> 38 

•:211> 347 

-.J!12> PRT 

213> Homo sapiens 

•:400> 38 

Met Gly Asp Glu Leu Ala Pro Cys Pro Val Giy Thr Thr Ala Trp Pro 
15 10 15 

Ala Leu Tie Gin Leu lie Ser Lys Thr Pro Cys Met Pro Gin Ala Ala 
20 25 30 

.■'ei Asn Thr Ser Leu Gly Leu Gly Asp Leu Arg Val Pro Ser Ser Met 
35 40 45 

Leu Tyr Trp Leu Phe Leu Pro Ser Ser Leu Leu Ala Ala Ala Thr Leu 
50 55 60 

Ala Val Ser Pro Leu Leu Leu Val Thr lie Leu Arg Asn Gin Arg Leu 

o5 70 75 80 

Arg Gin Glu Pro His Tyr Leu Leu Pro Ala Asn lie Leu Leu Ser Asp 

85 90 95 

Leu Ala Tyr lie Leu Leu His Met Leu lie Ser Ser Ser Ser Leu Gly 

100 105 110 

Gly Trp Glu Leu Gly Arg Met Ala Cys Gly He Leu Thr Asp Ala Val 
115 120 125 

Phe Ala Ala Cys Thr Ser Thr He Leu Ser Phe Thr Ala He Val Leu 
130 135 140 

His Thr Tyr Leu Ala Val He His Pro Leu Arg Tyr Leu Ser Phe Met 
145 150 155 160 

Ser His Gly Ala Ala Trp Lys Ala Val Ala Leu He Trp Leu Val Ala 
165 170 175 
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P( I I soo 3 1 50«J 



r-ys "y;. Lhe Pig Thi Ph.- Leu Tie T r p Leu S'er Lvs Trp Gin Af-'p Ala 

u-0 18 5 l^i; 

Gin Leu Gle Giu Gin Gly Ala iier Tyr lie Leu Pro Pro Ser Met G 1 y 

; c e" " LOO 205 

Thr Glr: 5 i . • Gly Cys Gly Ln: Leu Val T K* Val Thr Tyr Thr Ser He 

010 01 0 220 

Leu Z y :-: Va 1 Leu Phe Le u C y s Thr Ala I .*? u Tie Ala As n C y s P h e Tip 
^2 5 ' 2 30 235 Mu 

Arg lie Tyr Ala Glu Ala Lys Thr Ser Gly Tie Trp Gly Gin Gly Tyr 

245 250 255 

:-;«r Arq A1 C ] Arg Gly Thr Leu Leu lie His Ser Val Leu lie Thr Leu 

2 6C 2 65 27 0 

Tyr Val Ser Thr Gly Val Val Phe Ser Leu Asp Met Val Leu Thr Ann 

27 5 2S0 28 5 

Tyr His His lie Asp Ser Gly Thr His Thr Trp Leu Leu Ala Ala Asn 

290 295 300 

Ser Glu Val Leu Met Met Leu Pro Arg Ala Met Leu Pro Tyr Leu Tyr 

305 310 " 315 320 

Leu Leu hrq Tyr Arg Gin Leu Leu Gly Met Val Arg Gly His Leu Pro 

32 5 3 30 335 

Ser Arg Arg His Gin Ala lie Phe Thr He Ser 

3 4 0 3 4 5 

<21C ■ 39 

<211 ■ 1023 

<212 > DNA 

-213 ■ Homo sapiens 

4 0 2: • 39 

atga.-itccat ttcatgeate ttgttggaac acctctgccg aacttttaaa caaatc-Ttgg 60 

aa tauaqagt ttgettatea aactgecagt gtqqtaqata cagtcatcct cccttccavq 12 0 

uttgqgatta tctgttcaac aggqctggtt ggcaacatcc tcattgtatt cactataata 180 

aqat'::auqa aaaaaacagt. coctgacatc t at atetgea acctqgctgt ggctgatttq 240 

gt cc a :a tag ttggaatgcc ttttcttatt caccaarqgg cccgaggagg a gag tg gate 300 

tttgqggggc ctctctgcac catcatcaca tccctggata cttataacca atttgecrgt. 360 

agtgccatca tqactgtaat gagtgtggac aggtactttg ccctcgtcca accatf.cga 4 20 

ctgacacqtt ggagaacaag gtacaagacc ateegqatea atttqqgcct ttgqgcagct. 480 

tcctttat.ee tggcattgee tgtctggqtc tactcgaaga tcatcaaatt taaagaogqt 54 0 

gt tgagaq t L gtgcttttga tttgacatcc crtgacgatg tactctggta tacactttat 60 0 

ttgacgataa caactttttt 1 1. tccctcta crcttgattt tggtqtqcta tattttaatt. 660 

ttatgetata cttgggagat gtatcaacag aataaggatq ecagatgetg caatcccaqt 720 

ataccaaaac agagagtgat gaaqttgaca aagatgqtgc tggtgctggt ggtaqtcttt 780 

Faqe 3 8 
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a t c c t, g a g t g ctgcccctta rca:qr.qa;a caactggtca acttacagat qgaacagc c c 8 4 0 

acactggcct tctatgtggg ttattaccrc :ccatctqtc tcagctatqc cagcagcagc 9 C C 

attaaccctt ttctctacat cc:gctgag; ggaaatttcc agaaacgtct gcctcaaatc 960 

caaaqaagag cgactgagaa ggaaatcaac aatatgogaa acactctgaa atcacacttt 102 0 

tag 1023 

<210> 40 

<211> 340 

<212> PRT 

<2 1 3> Homo sapiens 

<<i00> 4 0 

Met. Asn Pro Phe His Ala Ser Cys Trp Asn Thr Ser Ala Glu Leu Leu 
1 5 10 15 

Asn Lys Ser Trp Asn Lys Glu Phe Ala Tyr Gin Thr Ala Ser Val Val 
20 25 30 

Asp Thr Val lie Leu Pro Ser Met lie Gly lie lie Cys Ser Thr Gly 
35 4 0 4 5 

Leu Val Gly Asn lie Leu lie Val Phe Thr lie lie Arg Ser Arg Lys 

50 55 60 

Lys Thr Val Pro Asp He Tyr lie Cys Asn Leu Ala Val Ala Asp Leu 
65 70 75 80 

Val His He Val Gly Met Pro Phe Leu He His Gin Trp Ala Arg Gly 

85 90 95 

Gly Glu Trp Val Phe Gly Gly Pro Leu Cys Thr He lie Thr Ser Leu 
100 105 110 

Asp Thr Cys Asn Gin Phe Ala Cys Ser Ala He Met Thr Val Met Ser 
115 120 125 

Val Asp Arg Tyr Phe Ala Leu Val Gin Pro Phe Arg Leu Thr Arg Trp 
130 135 140 

Arg Thr Arg Tyr Lys Thr lie Arg lie Asn Leu Gly Leu Trp Ala Ala 
145 150 155 160 

Ser Phe lie Leu Ala Leu Pro Val Trp Val Tyr Ser Lys Val He Lys 
165 170 175 

Phe Lys Asp Gly Val Glu Ser Cys Ala Phe Asp Leu Thr Ser Pro Asp 
180 185 190 

Asp Val Leu Trp Tyr Thr Leu Tyr Leu Thr He Thr Thr Phe Phe Phe 
195 200 205 

Pro Leu Pro Leu lie Leu Val Cys Tyr lie Leu He Leu Cys Tyr Thr 

210 215 220 

Trp Glu Met Tyr Gin Gin Asn Lys Asp Ala Arg Cys Cys Asn Pro Ser 
225 230 235 240 

Val Pro Lys Gin Arg Val Met Lys Leu Thr Lys Met Val Leu Val Leu 
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Val Veil Vdl Phe Tie Lou S-r Ala Ala Pro Tyr His Val Tic Gin L«-u 

2 o 0 2 o - 2' "/ 0 

Vai Asn Leu Gin Met Glu Gin E'l o Tr: r Lou Ala ?he Tyr Vu 1 Giy Tyr 

Tyr Leu Lor lie Cys Leu L^r Tyr Ala Ser Ser Ser lie Asn Pro Fne 
2 9 2 .c r 3 0 0 

Leu Tyr lie Leu Leu Ser Cly Asn Phe CM n Lys Arg Leu Pro Gin Tie 
305 310 ' 315 " 320 

Gin Arg Arq Ala Thr Glu Lys Glu lie Asn Asn Met Gly Asn Thr Leu 

325 330 335 

Lys Ser His Phe 
34 0 

•'■210 * 4 1 
<Jl 1 -■ 2 4 
■:212 * DNA 

:^li - Artificial Sequence 
•:220- 

•-22 1 • misc_feature 

■'.2 2 3 • Novel Sequence 

•:400 • 4 2. 

'jttqeaqaca tcaccatggc agee 2 4 

■:2 lo> -i;: 

• A_ i 1 - 2 4 

; : 2 • dna 

■:2l.r- /artificial Sequence 
0 ' ■ 

1' 1 ■ n.sc_feature 

'■:.'/>.' Novel Sequence 

•■ 4 00: ■ 4 2 

atgat.qctctgagtactggactgg 24 

•-.210 • 4 3 

: 1 L 2 0 

:12.- LNA 

.}''>■ Artificial Seauence 



m s c feature 
Novel Seauence 



• : 4 0 0 ' ■ 4 3 

QaaqctgtQd agagtgatgc 



:. 10 > 4 4 

ai> 24 

212 > DNA 
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2 1 3 > Artificial S c ouence 



:2 2 0 
:J21 



misc:_:eaturG 
Novel Sequence 



-MOO:- 4 4 

qtcagcadta ttgataagca gcag 2 4 

•■:210:- 4 5 

•:2il> 27 

2 > dna 

<213> Artificial Sequence 
< 2 2 0 > 

■ 22 1> nisc feature 

-'223> Novel Sequence 

M00> 4 5 

ccatggggaa cgattctgtc agctacg 2 7 

<210> 4 6 

<211> 24 

<212> DNA 

--'22 3> Artificial Sequence 
•:220> 

•■■ 221;- misc_feature 

** 2 3> Nove 1 Sequence 

*■■ 4 0 0 > 4 6 

gctatgcctg aagccagtct tgtg 24 

• ■ 2 1 0> 4 

v 2 1 1 > 2 6 

v2 12> DNA 

2 1 3'~> Artificial Sequence 

<:220> 

x22 1 > mis cofeature 

<22 3> Novel Sequence 

•MOCr- 4 7 

ccaaqatgtt gtgtcaccgt ggtggc 26 

■.1I0> 4 8 

v211 > 2 6 

- 22 2 > DNA 

<213> Artificial Sequence 

v.:.-: o> 

v 2 2 1> ni sc_f eature 

* 223> Novel Sequence 

v400r- 48 

cacagcgctg cagccctgca gctggc 26 
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.2 10 > 4 r< 
2 1. > r 

::212 - DNA 

: 2 1 3 - ArtiiiriaJ Sequence 



- 2 2 ! * m i s c_foatur e 
: 2 2 J - N o v e 1 Sequ e n c: e 



: r t c ■:: t c *.: c g tagggatgaa ccagac 



21 u ■ r > : 

2n 

2 12 ■• orJA 



artificial Sequence 



m _se_ feature 
rJ)vel Sequence 



AOO - SO 

•tcgracagg tgggaagcac ctgtgg 



■ji'i ■ si 
2:1.' 

. 212 * DNA 

:2L?-- Artificial Sequence 



22 1 ■■ msc_feature 
22^. • IJovel Sequence 



4 0 0:- 21 

cc*gtgaca gqaggtaccc tgg 

2 10> L 2 

:2 1 1 2 2 

:: : i>na 

2: ■ Artificial Seouence 



22 1- n:sc_ feature 
:22. J * Novel Sequence 



-atatccctc cgagtgt ecu qcggc 



2 : ,1 - > j 

: 2 1 1 - J- I 

2:i2- UNA 

221/- Artificial Sequence 

-221 > m::c feature 
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N o v e 1 3 equ e n c e 



: 4 0 0 > 5 3 

gcatggagaq aaaatttaty 



tccttgcaac c 



;210 • 

■:212 • 
O 1 3 • 



DNA 

Artificial Sequence 



2L 



mi sc_f ea ture 
Novel Sequence 



• A00-- 34 

caagaacaqg tctcatrtaa qagctcc 



37 




<3 l ;:> dna 

<213> Artificial Sequence 
<?2 0> 

< 221> n:sc_feature 

• 273> Novel Sequence 

v4u0:* 3T> 

uctgttgcca tgacgtccac ctgcac 26 



■:2 12^ DNA 

-2] 3> Artificial Sequence 
-220:- 

-.221> misc_feature 

22 3> l.'ovel Sequence 

^4 00> 3 6 

qgacagttca aggtttgcct tagaac 2 6 




5 6 
2 6 




DNA 

Artificial Sequence 



2 1 




ni sc_f eature 
Novel Sequence 



* 40C> 57 

ctttcgatac tgctcctatg etc 



210> 
<-211> 
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l ka 



mi s c : e L ; t u : O 



itar ccac: :: < : a a u t c c a a tqat.cc 



10 
1 1 

i :> 



1 ? • Artificial Sequence 



m i 5: c teatu r e 
t^ovol Sequence 



t j a a t. ccaac catctc 



MO 
' I 1 



una 

A r ' .: . 



1 1 Seauence 



n._ : • t ure 
N -i . • ; : u e n c e 



C t a ■ ■; 1 ■:: : 



'j ;: ■.; . :,ijqc tcttc 



: 1 1 



L'N A 
Art l 



ici.il Sequence 



22 1 



m: s c t oa t u re 
N c i v u i S e q u e n c e 



• ']()■: i 
agaa 



-q*a Uiqacccagc gtcgctcc 



21 'r- 28 
212:- DKA 

21 V- Artificial Geauence 



nisi I e a t r e 
rJovcI Seauence 
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aaag q t 1 q c g ccrragcgac a g a t g a c c 



:JIO ■ ->3 

21 1 • 22 

:2 12 - DNA 

:213- Artificial Sequence 



nisc_feature 
Novel Sequence 



•■ A 0 1"' : - 6 3 

ctq<-:acccgg acacttgctc tg 



UNA 

Artificial Sequence 



misc_f ea ture 
Novel Sequence 



<4 00> '"-1 

qtctgcttgt tcagtgccac tcaac 



2 5 



:10:.- k5 

Ml: 2 6 

M2-- DNA 

.'13-- Artificial Sequence 



riisc_f ea ture 
Novel Sequence 



■.4 0(t t"5 

tatctgcdat tctattctag ctcctg 



2 6 



-.2:iC; *b 

-111:- 2 b 

•:2 12 • LNA 

v213> Artificial Sequence 
<220:- 

v 2 2 1 * ■ mi s c_ f e a t u r e 

•22 3> Novel Sequence 



■ 4CC *.>b 

tctccctaat aaagtcacat gaatgc 



> no: 

■-' 211: 
^220:- 



I 3 
DNA 

Artificial Sequence 
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nisc teatute 
Novt-1 Soauen::t 



• A 0 0 > 

g g a g a c .1 1 7 7 a t q a a t g age c a c 



: : 1 1 ■ . 

.2 12 - LNA 

' 2 1 3 ■ Artificial Sequence 



• 1 • :nisc_f eature 
-:22 3 • Novel Sequence 



tatticaa'ig gttgtttgag taac 



. 1 0 • o ••• 

ill ■ 2 / 

J 12 ■• DNA 

.13'- Artificial Sequence 
.20 > 

J 2 I > msc feature 

.'2 3:- Novel Sequence 



* : 0 0 > 6 9 

ggcaccagtg gaggttttct gagcatg 



2" 

.12. DNA 
I;.:- Artificial Sequence 



'A ni sc_f eature 
!;"..■ Novel Sequence 



; 0 1": 

:gatgga ( jq tagaggctgt ccatctc 



2 I » ' 

212 DNA 

213 ■ Artificial Sequence 



.-21 mi sc_l" eature 
' 2 2 ; Ncvel Soauence 



■:4 00.- 7 1 

:ctgucgagc cgctagcgcc atg 

: 2 1 2 : ■ 7 2 
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: 1 i > 2 3 

J 12-* DNA 

'13> Artificial Sequence 



i -> rnisc_feature 
5> Novel Se cue nee 



A 0 0 » 2 
, 1 tqag c c j *: g ccaggccctc agt 

<21U-> 7 3 

■■:21i> 2 7 

• :212 > DNA 

■'.213:- Artificial Sequence 
■ :22U> 

'■.221:- misc_feature 

•■22 3:-' Novel Sequence 

-A00> ^3 

ctgcqatqcc cacactcaat acttctg 



•:210> 7-1 

-~211> 2' r 

'■■212> DNA 

•'213> Artificial Sequence 

•■■ 2 2 Or- 

• . 2' 1 i > 
- -i 



nisc_f eature 
2 3> Novel Sequence 



• -3 00> n A 

aaggatccta cacttggtgg atctcag 27 

v210> 7S 
-■■21 2> DUA 

'.213: Artificial Sequence 
A 0 0 > 7 S 

gctggagcat teactaggeg ag 2 2 

v210> 7 6 

<211> 2", 

- 212:> DNA 

'-21 3> Artificial Sequence 

* 2 2 0> 

221> mi sc_f eature 
<~22> Novel Sequence 

- A 00:- 7 6 

aqatcctggt tcttggtgac aatg 24 

<210..- 11 
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^ i :i - : 

i j f i a 

«_ 1 3 . A: t'.fic.d 1 . :.. • • q uo nee 

3 20 ■ 

3 3 3 Novel S equo n c e 
•3 Ci 0 ■ 

gc:Cdtcrct qccagqaaqc atgq 

3 3 0.- "?y 

3 3 2.- rja 

23 3 ■ Artificial Sequence 
.2 i): * 

32 1- :iiisc_f eature 

^13 Novel Sequence 



•1 0 1.) n 8 

cagactqtg gactcaagaa ctctagg 



."■!:• 2 f- 
. 32 ■ f'UA 

3.-.V Artificial Sequence 



: 2 ' • iiii.se feature 
:3> Novel Seauence 



•10 0 • n 9 

cr. .c<.. accaa caatgaatcc atttcatg 



.12 • nil A 
13- Artificial Sequence 



22 1 • riLs:_teature 
223 • Novel Sequen : 



A t.iy ■ Hi 

.tca*,gti;t.a gactcatggr. gatcc 



:32 ■ una 

^13* Artificial Sequence 
\">0 • 

!2: * mis :■_ feature 

■23 > Novel Sequence 



27 
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7 >7 q O ci Q 


~ n a a agCdaagqiQ ot cctcctcjc 


-.21 o> 






3 0 


12:- 


)NA 


13- 


Artificial Sequence 








misc feature 


: ^ 2 3 : ■ 


Novel Sequence 


:.;00:- 




:caqcagaac cacctttgct ttccctcccc 


■:io:- 


8 3 


211 > 


1356 


:i,..- 


UNA 


.213:- 


Homo sapiens 


4 00:- 


83 



3 0 



30 



a tggagtcct 


cacccatccc 


ccagtcat ca 


gggaactct t 


ccactttggg 


gagggtccct 


60 


caaaccccag 


gtccctctac 


tgccagtggg 


gtcccggagg 


tggggctacg 


ggatgttgct 


12 0 


t cggaatctg 


tggccct ctt 


ct tcatgctc 


ctgctggact 


tgactgctgt 


ggctggcaat 


18 0 


• j c c g < " t g t g a 


tggccgtgat 


cgccaagacQ 




y 0 □ u a l_ L. ^ ^ 




2 A 0 


:: a zzzctgcc 


tggt ggacct 


gctggctgcc 


ct gaccctca 


tgcccctggc 


catgctctcc 


3 0 0 


,ig: t ctgccc 


tctttgacca 


cgccctcttt 


ggggaggtgg 


cctgccgcct 


ctacttgttt 


3 60 


tgagcgtgt 


gctttgtcag 


cc tggccatc 


ct ctcggtgt 


cagccatcaa 


tgtggagcgc 


^20 


"actattacg 


t agtccaccc 


ca tgcgctac 


gaggtgcgca 


tgacgctggg 


gctggtggcc 


A SO 


t ctgt gctgg 


tgggtgtgtg 


ggt gaaggcc 


t tggccatgg 


cttctgtgcc 


agtgt tggga 


bAO 


agggtctcct 


gggaggaagg 


agctcccagt 


gt ccccccag 


gctgttcac t 


ccagtggagc 


bOO 


cacagtgcct 


actgccagct 


ttttgtggtg 


gtctttgctg 


tcctttactt 


tc tgt tgccc 


6 K) C 


ctgctcctca 


tacttgtggt 


ctac tgcagc 


atgttccgag 


tggcccgcg t 


ggct gccatg 


"2 0 


rag cacgggc 


cgctgcccac 


gtggatggag 


acaccccqqc 


aacgctccga 


a tct ct cage 


-so 


agccgctcca 


cgatggt cac 


cagctcgggg 


gccccccaga 


ccaccccaca 


ccggacgt 1 1 




cjggggaggga 


aagcaaaggt 


ggtt ctcctg 


gctgtggggg 


gacagttcct 


get c tgttgg 


<*0 0 


ttgccctact 


tctctttcca 


cctctatgtt 


gccctgagtg 


ctcagcccat 


ttcaactggg 


960 


caggtggaga 


gtgtggtcac 


ctgga t tggc 


tacttttgct 


tcacttccaa 


ccctttctt c 


102 0 


tatggatgtc 


t caaccggca 


gatccggggg 


gagctcagca 


agcagt tt gt 


ctget t cttc 


10 8 0 


aagccagctc 


cagaggagga 


gctgaggctg 


cctagccggg 


agggct ccat 


tgaggagaac 


114 0 


ttcctgcagt 


tcct tcaggg 


gactggctgt 


cct tctgagt 


cctgggtttc 


ccgaccccta 


1200 


cccagcccca 


agcaggagcc 


acctgctgt t 


gactt t cgaa 


tcccaggcca 


gatagctga g 


1260 
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q o a c c t. o tq aqtt c c T g 



q a a acd q c dart i' acraqcgjCd t :atcatqt c 



l :;-;'0 



: r . • . q:ccu 



c c q o c c d 



c c g acta a a c t u a :qh 




H4 
4 51 
PPT 

Homo .sapiens 



Met Glu Set Set Pro lie Pre Gin 3er Ser 31 y Asn Ser Ser Tf.r Leu 
1 5 10 15 

Gly Arq Va 1 Pro Clr. Thr Pro Gly Pro Ser Thr Ala Ser Gly Val Pro 

2 0 2 5 3 0 

Glu Val Gly Leu Arg Asp Veil Ala Ser Glu Ser Val Ala Leu Phe Pho 
3 5 4 0 4 5 

Met Leu Leu Leu Asp Leu Thr Aid Val Ala Gly Asn Ala Ala Va J Met 

50 bb 60 

Ala Val He Ala Lys Thr Pro Ala Leu Arg Lys Phe Val Phe Val Phe 

6 5 7 0 7 5 8 0 

His Leu Cys Leu Val Asp Leu Leu Ala Ala Leu Thr Leu Met Pro Lou 

6 5 90 ?b 

Ala Met Leu Ser Ser Ser Ala Leu Phe Asp His Ala Leu Phe Gly Glu 
100 105 * 110 

Val Ala Cys Ara Leu Tyr Leu Phe Leu Ser Val Cys Phe Val Ser Leu 
115 120 125 

Ala Tie Leu Ser Val Ser Ala Tie Asn Val Glu Arg Tyr Tyr Tyr Val 
130 135 140 

Va: Has Pro Met Arg Tyr Glu Val Arq Met Thr Leu Gly Leu Val Ala 
145 150 155 160 

Ser Val Leu Val Gly Val Trp Val Lys Ala Leu A^a Met Ala Ser Val 
165 17C 1^5 

Pro Val Leu Gly Arg Val Ser Trp Glu Glu Gly Ala Pro Ser Val Pro 

160 ' IPS 190 

Pro Gly Cys Ser Leu Gin Trp Ser His Ser Ala Tyr Cys Gin Leu Phe 
195 200 205 

Val Val Val Phe Ala Val Leu Tyr Phe Leu Leu Pro Leu Leu Leu lie 
210 215 220 

Leu Val Val Tyr Cys Ser Met Phe Arq Val Ala Arg Val Ala Ala Met 
22 5 230 235 24 0 

Gin His Gly Pro Leu Pro Thr Trp Met Glu Thr Pro Arg Gin Ara Ser 

24 5 2 50 " 2 55 

Glu Ser Leu Ser Ser Arg Ser Thr Met Val Thr Ser Ser Gly Ala Pro 

260 265 270 

Gin Thr Thr Pro His Arg Thr Phe Gly Gly Gly Lys Ala Lys Val Val 
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275 280 285 

Leu Leu Ala Val Gly Gly Gin Phe Leu Leu Cys Trp Leu Pro Tyr Phe 

2 9 0 2 9 5 2 0 C 

Ser Phe His Leu Tyr Val Ala Leu Ser Ala Gin Pre- lie Ser Thr Gly 

305 310 315 320 

Gin Val Glu Ser Val Val Thr Trp lie Gly Tyr Phe Cys Phe Thr Ser 
325 ' 330 335 

Asn Pro Phe Phe Tyr Gly Cys Leu Asn Arg Gin lie Arg Gly Glu Leu 
3-30 ' 345 350 

Ser Lys Gin Pne Val Cys Phe Phe Lys Pro Ala Pro Glu Glu Glu Leu 
355 360 365 

Arg Leu Pro Ser Arg Giu Gly Ser lie Glu Glu Asn Phe Leu Gin Phe 
370 375 380 

Leu Gin Gly Thr Gly Cys Pre Ser Giu Ser Trp Val Ser Arg Pro Leu 
385 390 395 400 

Pro Ser Pro Lys Gin Glu Pro Pro Ala Val Asp Phe Arg lie Pro Gly 
405 410 415 

Gin lie Ala Glu Glu Thr Ser Glu Phe Leu Glu Gin Gin Leu Thr Ser 
420 425 430 

Asp lie lie Met Ser Asp Ser Tyr Leu Arg Pro Ala Ala Ser Pro Arg 
435 440 445 

Leu Glu Ser 
450 

<210> 85 

v211> 28 

•:212 » DNA 

<213> Homo sapiens 

<4 00> 8 5 

caggaagqea aagaccacca tcatcatc 28 

<210> 8 6 

<211> 28 

<212> DNA 

<213> Homo sapiens 

<400:- 86 

catgatgatg gtggtctttg ccttcctg 28 

v210> 87 

<H> 1041 

« 212> DNA 

■'213> Homo sapiens 

v-400> 87 

atgqagagaa aatttatgtc cttgcaacca tccatctccg tatcagaaat ggaaccaaat 60 

ggcaccttca gcaataacaa cagcaggaac tgcacaattg aaaacttcaa gagagaattt 120 

ttcccaattg tatatctgat aatatttttc tggggagtct tgggaaatgg gttgtccata 1R0 
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taMrt rr:c t r.-.;::r t:: : .1 : .1 gt • c acatctgtqo L ; ; r. 1 t t ; a t : • ' : . 1 1 : • ' ; ^'40 

a v- c ci r t t c a a n: : r i\u jrt cntadacacG c:t c cct tea g a c t q li c t ^ t r a t r 1 t a g a 3 w ■ < 

ngct ■: :jat t a:jit^M taq jc-::taa::" tqC":ig^t::a t q t c t t a t o :itctaiyt i' .3 0 

.-1 a c a rqtaca q c a q t a 1 1 r. a r. t 1" c ctqac: gtg::qaq t q rt ]cqc 9 1 t t c • t g gc-.^tq 4 2 0 

7t tc ncecct r : -qqc: r ct gcat crtcac j aqca:c^qg c gt icctcj;:: gt qqq 4 8 0 

.itcar.atgga tccttatcat. qgcitcctc.i ataatcctc c t q q a c a g t g g c t c : qaq:aq 5 4 0 

aacgqcagtg tcacatcatg cttaqaqctq aat etc lata uaattgct.dd qctgeagace: 600 

a t q a a c t a t a 1 1 q c c 1 1 g g t g g t q g g c t g c ctqctuccat ttt.tcaca.ct cagcatctgt 600 

*.alc':gctga tear. tegggt ;c:qitaaad gtggaggtec cagaatcqag getoegggtt 720 

t c t c a c a g g a aqacaaagdC c a c catcat c atcaccttga rcatcttctt c':t gtgtttc 7 8 0 

e t g c o c t a c c a c a c a c t g a g gaccgtccac ttgacgacat ggaaagtqqq :t:tatgcaaa 8 4 0 

qacaoactgc ataaaqcttt ggttatcata ctggccttgg cagcagccaa t g cc t get tc 90 0 

aatcctctgc tctattactt tgctqgggag aattttaagg acagactaaa gtctgcactc 960 

agaa^aggee atcca:agaa ggcaaagaca aagtgtgttt tccctgtrag tgtgtggttg 1020 

< j g a a a g g a a a c a a q a g t. a t a a 104 1 

210 • 8 8 
211- 346 
212:- PRT 

213:- Homo sapiens 
<400:- 88 

Met l;Ju Arg Lys Phe Met Ser Leu Gin Pro Ser Tie Ser Vai Ser Giu 

1 5 10 13 

Met Giu Pro Asr; Gly Thr Pho Ser Asn Asn Asn Ser Arq Asn Cys Thr 

2 0 2b 30 

Tie Gla Asn Phe Lys Arg Giu Phe Phe Fro lie Va 1 Tyr Leu TI^ He 



4 0 4 5 



Phe Phe Trp Gly 7a 1 Leu Gly Asr. Gly Leu Ser He Tyr Val Phe Leu 
r .Q 0 3 60 

Gin Pro Tyr Lys Lys Ser Thr Ser Val Asn Val Phe Met Leu Asn Leu 

65 70 75 SO 

Ala He Ser Asp Lei; Leu Phe He Ser Thr Leu Pro Phe Arg Ala Asp 

3 5 9 0 0 5 

Tyr Tyr Leu Aig Giy Ser Asn Trp He Phe Gly Asp Leu Ala Cys Arg 

100 105 110 

He Met Ser Tyr Ser Leu Tyr Vai Asr. Met Tyr Ser Ser lie Tyr Phe 
1 1 5 1 2 0 12 5 

Leu Thr Val Leu Ser Val Val Arg Phe Leu Ala Met Val his Pro Phe 
1 3 0 1 3 5 1 4 0 



Pace 5; 



WO 01/364 "'I 



P( l /l SOO/3150<> 



Arq I .on His Val Thr Ser lie Arg Ser Ala Trp :ie Lou Cys Gly 

14 5 1 5 Ci " 155 J 160 

lie lie Trp He Leu lie Met Ala Ser Ser He Met Leu Leu Asp Set 
16 5 170 175 

Gly Ser Glu Gin Asm Gly Ser Val Thr Ser Cys Leu Gl u Leu Asn Leu 
180 185 190 

Tyr Lys Tie Ala Lys Leu Gin Thr Met Asn Tyr He Ala Leu Val Val 

195 200 205 

Gly Gys Leu Leu Pro Phe Phe Thr Leu Ser He Cys Tyr Leu Leu He 
210 215 220 

He Arg Val Leu Leu Lys Val Glu Val Pro Glu Ser Gly Leu Arg Val 
225 230 235 240 

Jor His Arg Lys Ala Lys Thr Thr He He lie Thr Leu He lie Phe 
245 250 255 

Phe Leu Cys Phe Leu Pro Tyr His Thr Leu Arg Thr Val His Leu Thr 

260 265 270 

Thr Trp Lys Val Gly Leu Cys Lys Asp Arg Leu His Lys Ala Leu Val 
275 280 285 

lie Thr Leu Ala Leu Ala Ala Ala Asn Ala Cys Phe Asn Pro Leu Leu 

290 295 300 

Tyr Tyr Phe Ala Gly Glu Asn Phe Lys Asp Arg Leu Lys Ser Ala Leu 
305 310 315 320 

Arg Lys Gly His Pro Gin Lys Ala Lys Thr Lys Cys Val Phe Pro Val 
325 330 335 

Ser Val Trp Leu Arg Lys Glu Thr Arg Val 
34C 345 



<2 1 0 > 


8 9 


<2ir> 


28 


<212> 


DNA 


<213^ 


Artificial Sequence 


< 2 2 0 > 






misc feature 


v22 3> 


Novel Sequence 


-,4 0 0> 


89 


ccagtgcaaa gctaagaaag tgatcttc 


<21 ti> 


90 


■21 1> 


28 


, ^ i 2 . 


DNA 


<-213:- 


Artificial Sequence 


< 22 0 




- 221> 


misc feature 


<223> 


Novel Sequence 



28 



< 4 0 0 > 9 0 

gaaqatcact ttcttagctt tgcactgg 
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WO 01 3(»4^1 



-210.- 

1 : • .is; -7 

• . i • 2NA 

■.2 1 S ■ H •:• ^p.en: 

• 4 2 '.' • - 1 

a t q :-icncc.: cct ccacraa cjqcacoc^: q a :i a q t. a a c a g a g :: c aca: q t. g;ar g c q q 6 0 

etc: c :: a a a a tgrccat ;ao crtgg::cac qgeatea: q c: ;;rt caaccq: g ctgqrt a t c 1 2 0 

t t cot qqceg c;:r.c:tt:gt :ggcaacata gtgctqgcac tagtgttgca qcgcaagccg I HO 

cagctg qtgc a:nga ::ad :cgttttatc tt:.aacct:c v.qgtcaccga cqtgctgcag 2A\: 

at:.: c q q t : q c c q ztg g q t g g t g g c c ac:t c : g t q c :tct ■:: 1 1 c t. g gococt :aa: 3 ) 

agcca::t:t g::acgq::ct. gqttagcctc acxac:tgt t: ;qc-:ttcgc :agcot :aac 36") 

ac:at:gL:: : g g t g t :a n t. ggategctae ttgtccatja trcaecctct ::c-:ta:ccg ■2 )i ; 
:::aagatga cccagcg::g cggttacctg ctcctctatq g::ac:tggat tgtggccat: 

ctgeaqagea ctcctcqact etaeggctgg ggccaggctg cctttgatga gegcaatget 5*10 

ctctgctcca tgatctgggg ggccagcccc agctacac:a ttctcagcgt ggtgtccttc 60 0 

at:g:ca:t : cactgattgt catgattgee tgetactreg vg^tgttctg tgcagcccgg 660 

aggcagcatg ctctgotgta caatgtcaag agaca:agoT tgjjaagtgcg agt:aaggac 7;!0 

tgtgtggaga atgaggatga aqagggagca ga gaagaagq aqgagttcca ggatgagagt 7H0 

gagttt cgee g::agca:ga aggtgaggtc aaggccaagq agggqagaat ggaaqccaag S 

gaeggcageq tqaaggccaa ggaaggaagc aeggggacca gtgagagtaq tgtagaggee 900 

agqggcagcg aggaggtcag agagagcagc acggtggccd gcgaqggcag catggagggt 9*>0 

aa'5gaagg:a qcaccaaaqz tgaggagaac ag::atqaagg cagacaaggg togqacagag 102 0 

gtcaaccag: qcaqcattqa :ttgggtgaa qatgacr.gg aq::tqgtga agarqacat-: 1 Oko 

aatttcaqtg aggatgacgt cgaggcagtg aacatcircgq agagcctc-:c acccaqzcqz 1 ;i '1 0 

cgtaacaq-a acagcaac;c tcctctgccc aqqtgctacc agtqcaaagc taagaaagtg 1200 

azczzcazca tcattttctc ctatgtgcta tcoctgggqc cctactgctt. tttagcag: : 12m!' 1 

ctggccgtqt gggtgga *~. gt cgaaacccag qtacccqaqt gggtgatcac cataatca:: • 32' 

• gqct 1 1. :.c*_ tcctgcaqt.q ctgcatccac ccctatgtct atggctacat gcacaaqacc I3H0 
attaagaaqq aaatccaqga catgetqaaq aagttcttct gcaaggaaaa qcccccgaaa 1 A A C 
qaagataqcc acc.caqaccz gcccggaaca gaqqgrggga cr.qaaggcaa gattgtccct. 15 0 0 
•.cctacgatt. ctqctacui tccttqa 2 5 2 7 

<:i0> [i2 

■ , 1 1. > 50 8 
-.212 > PRT 

■ 2 13> Home sapiens 

Faqe 5 4 
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<400> 92 

Me: Thr Ser Thr Cys Thr Asn Ser Thr Arc Glu Ser Asn Ser Ser His 
1 5 icT IS 

Thr Cys Met Pro Leu Ser Lys Me: Pro He Ser Leu Ala His Gly He 

20 25 30 

He Arg Ser Thr Val Leu Val Tie Phe Leu Ala Ala Ser Phe Val Gly 
3 5 4 0 4 5 

Asn lie Val Leu Ala Leu Val Leu Gin Arg Lys Pro Gin Leu Leu Gin 
50 55 60 

Val Thr Asn Arg Phe He Phe Asn Leu Leu Val Thr Asp Leu Leu Gin 

65 70 75 80 

He Ser Leu Val Ala Pro Trp Val Val Ala Thr Ser Val Pro Leu Phe 

85 90 95 

Trp Pro Leu Asn Ser His Phe Cys Thr Ala Leu Val Ser Leu Thr His 
100 105 110 

Leu Phe Ala Phe Ala Ser Val Asn Thr He Val Val Val Ser Val Asp 
115 120 125 

Arg Tyr Leu Ser He He His Pro Leu Ser Tyr Pro Ser Lys Met Thr 
130 135 140 

Gin Arg Arg Gly Tyr Leu Leu Leu Tyr Gly Thr Trp He Val Ala lie 
145 150 155 160 

Leu Gin Ser Thr Pro Pro Leu Tyr Gly Trp Gly Gin Ala Ala Phe Asp 
165 170 175 

Glu Arg Asn Ala Leu Cys Ser Met He Trp Gly Ala Ser Pro Ser Tyr 
180 185 190 

Thr lie Leu Ser Val Val Ser Phe He Val He Pro Leu He Val Met 
195 200 205 

He Ala Cys Tyr Ser Val Val Phe Cys Ala Ala Arg Arg Gin His Ala 

210 215 220 

Leu Leu Tyr Asn Val Lys Arg His Ser Leu Glu Val Arg Val Lys Asp 

225 230 235 240 

Cys Val Glu Asn Glu Asp Glu Glu Gly Ala Glu Lys Lys Glu Glu Phe 
245 250 255 

Gin Asp Glu Ser Glu Phe Arg Arg Gin His Glu Gly Glu Val Lys Ala 

260 265 270 

Lys Glu Gly Arg Met Glu Ala Lys Asp Gly Ser Leu Lys Ala Lys Glu 

275 280 265 

Gly Ser Thr Gly Thr Ser Glu Ser Ser Val Glu Ala Arg Gly Ser Glu 
290 295 300 

Glu Val Arg Glu Ser Ser Thr Val Ala Ser Asp Gly Ser Met Glu Gly 
305 310 315 320 

Lys Glu Gly Ser Thr Lys Val Glu Glu Asn Ser Met Lys Ala Asp Lys 

325 330 335 

Page 55 
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G 1 y A r g T h r G 1 u V a I As r i Gin C y s S e i II c A s p Leu G J y uiu As p. As p 

3 -I 0 34 5 3 50 

t G 1 1: L'he Gly Glu Asp Asp lie As ri Phe Ser Glu Asp Asp ','a 1 G i u 

.• f 3 3 6 0 3 6 5 

A I l j V a 1 As n He P ro Glu Sei Le u P r o P r c S e r A r g A r g As 1 1 Se r As n 

3 7 0 3 7 5 7 8 0 

Ser Asn Fro Pro Leu Pro Arg Cys Tyr Gin Gys Lys Ala Lys Lys Val 

3H 3 90 395 4 00 

Phe He He lie Phe Ser Tyr Vdi Lou Ser. Leu Gly Pro Tyr Cys 
4 0 5 4 10 4 15 

Fri" Leu Ala Val Leu Ala Veil Trp Va 1 Asp Val Glu Thr Gin Val Pro 

4 2 0 4 2 5 4 3 C 

■:r, 'L i r Va 1 lie Thr He lie lie Trp) Leu Pho Phe Leu Gin Cys Cys 
4 3 5 4 4 0 4 4 5 

:^ H.y P:.. Tyr Val Tyr Gly Tyr Me: His Lys Thr He Lys Lys Glu 
4 ; 4 5 5 4 60 

' H Gin A>sp> Met Leu Lys Lys Phe Phe Cys Lys Glu Lys Pro Pro Lys 

4y5 4 70 475 4 80 

Gu A 1 j Lis Pro Asp Leu Pro Gly Thr Glu Gly Gly Thr Glu Gly 

4 85 4 90 4 95 

Lv. 1H v.jl Pro Ser Tyr Asp Ser Ala Thr Phe Pro 

SO0 505 



. 1 • • Arv. r icidl Sequence 

-'221 ■ m i y .. ■ t e a t u re 
• . 2 1 3 - Nov-. 5 equence 

4 :j 0 ■ 9 3 

gccgccacog cqccaagaqg aagat tggc 29 

-21 ■ • 34 
■'.21 : • 

• : H ■ PNA 

• 21- Ait i t lcial Sequence 

■ 2 2 1 * r. _ o _• featur e 
y32 3 • Novel Sequence 

- 4 •'• • H 

gccaacct.M" ctcttggcgc ggtggcggc 2 9 

•" 2 1 0 > 5 5 

• JH • 1 C 3 2 

• 2i; • SNA 

P a a e 5 6 



WO 01/36471 
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< 2 I 3 > Homo sapiens 
<4 00> 95 



atggaccccg 


gcgaggcgct 


gctggegggt: 


ctcctggtga 


tggtactggc 


cgtggcgctg 


6 c 


cidtccciacg 


cactggtgct 


ccttrgttgc 


gcciacagcg 


ctgagctccg 


cactcgagcc 




tcaggcgtcc 


tcctggtgaa 


tctqtcqctq 


qgccacctgc 


tgctggcggc 


gctggaca t. g 


1 8 0 


ccctt cacgc 


tg c t egg t g t 


gatgegeggq 


cggacaccgt 


cggcgcccgg 


cgcatgccaa 


2 4 0 


gtc^t t gg c t 


tcctggacac 


cttcctggcg 


tccaacgcgg 


cgctgagcg: 


ggeggeget g 


30 0 


agcgcagacc 


a< ?t ggctggc 


ag tgggc 1 1 c 


ccac tgeget 


acgccggacg 


cctgcgaccg 


3 b 0 


cgctatgccg 


gcctgctgct 


gggc t g tgee 


tggggacagt 


cgctggcctt 


ctcaggcgct 


4 20 


gcacttggct 


gctcgtggct 


tggctacagc 


agegee t t eg 


cgtcctgtt c 


g:tgcgcctg 


4 8 0 


ccgcccgagc 


ctgagcgtcc 


gege ttcgea 


gec 1 1 caccg 


ccacgct cca 


tgccgtgggc 


5 4 ' ) 


ticgtgctgc 


cget ggcggt 


gctctgcctc 


acctcgctcc 


aggtgcaccg 


ggtggcacgc 


6 ( ) 0 


(jgccactgcc 


agcgcatgga 


caccgt cacc 


at gaaggege 


tcgcgctgct 


cgccgacctg 


6o0 


caccccagtg 


tgcggcagcg 


ct gcctcatc 


cagcagaagc 


ggcgccgcca 


ccgcgccacc 


7 2 0 


aggaagat tg 


gcat tgctat 


tgcgaccttc 


ct catctget 


t tgccccgta 


tgtcatgacc 


7 8C 


aggctggcgg 


agctcgtgcc 


cttcgtcacc 


gt gaacgccc 


agaagggcat 


cctcagcaag 


8 4 0 


tgcctgacct 


acagcaaggc 


ggtggccgac 


ccgtt cacgt 


actctctgct 


ccgccggccg 


90 0 


ttccgccaag 


tcctggccgg 


ca tggtgcac 


egge tgctga 


agagaacccc 


gcgcccagca 


9t-.fi 


tccacccatg 


acagct ctct 


gga tgtggcc 


ggcatggtgc 


accagctgct 


gaagagaacc 


102 0 


ccgcgcccag 


cgtccaccca 


caacggctct 


gtggacacag 


agaatgattc 


ctgcctgcag 




cagacacact 


ga 










109^ 



<210> 96 

<211> 3 63 

<212> PRT 

<213> Homo sapiens 

< 400> 96 

Met Gly Pro Gly Glu 
1 5 

Ala Val Ala Leu Leu 

20 

Ser Ala Glu Leu Arg 

35 

.Ser Leu Gly His Leu 

50 

Leu Gly Val Met Arg 

6 5 

Val He Gly Phe Leu 



Ala Leu Leu Ala Gly Leu Leu Val Met Val Leu 
10 15 

Ser Asn Ala Leu Val Leu Leu Cys Cys Ala Tyr 
25 30 

Thr Arg Ala Ser Gly Val Leu Leu Val Asn Leu 
4 0 4 5 

Leu Leu Ala Ala Leu Asp Met Pro Phe Thr Leu 

55 60 

Gly Arg Thr Pro Ser Ala Pro Gly Ala Cys Gin 
70 75 80 

Asp Thr Phe Leu Ala Ser Asn Ala Ala Leu Ser 

Paae 57 
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S 5 9 0 9 r, 

V H A 1 c i Ala Leu Ser A 1 a A s ■ G In 'I ' r p Leu A 1 a V a J. G 1 y Phe Pre L e u 

100 " 105 " 110 

A r c T y r A 1 ?i 31 y A r ci Lou A r q Pi A r g Tyr A 1 a Gly L - u L o u L - u 31 y 

i : 5 1 0 0 1 2 r ; 

Cvs Ala Trp Gly Gin Ser Lou Ala Phe Ser Gly Ala Ala Leu Gly Cys 

130 135 14 0 

Ser Trp Leu Gly Tyr Ser Ser Al i-x Phe Ala Ser Cys Ser Leu Arg Leu 

145 ISO 1SS ' 160 



Pro Pro Glu Pro Glu Arg Pre Arg 

16S 

His Ala Val Gly Phe Val Leu Pro 
180 

Leu Gin Val His Arg Val Ala Arg 
i y S 2 0 ■ J 



Phe Ala Ala Phe Thr Ala Thr Leu 

17 0 3 7 5 

Leu Ala Val Leu Gys Leu Thr Ser 
185 190 

Ser His Gys G_n Arg Met Asp Thr 

2 05 



Val Thr Met Lys Ala Leu Ala Leu Leu Ala Asp Leu His Pro Ser Val 
210 215 220 

Arq Gin Arg Cys Leu lie Gin Gin Lys Arg Arg Arg His Arg Ala Thr 

225 230 235 240 

Arg Lys He Gly lie Ala He Ala Thr Phe Leu He Cys Phe Ala Pro 

245 250 255 

Tyr Val Met Thr Arg Leu Ala Glu Leu Val Pro Phe Val Thr Val Asn 

260 265 270 

Ala Gin Lys Gly He Leu Ser Lys Cys Leu Thr Tyr Ser Lys Ala Val 

275 280 285 

A . a Asp Pro Phe Thr Tyr Ser Leu Leu Arq Arg Pro Phe Arg Gin Val 

290 295 300 

Leu Ala Gly Met Val His Arg Leu Leu Lys Arg Thr Pro Arg Pro Ala 

305 310 315 320 

Sor Thr His Asp Ser Ser Leu Asp Val Ala Gly Met Val His Gin Leu 

325 330 335 

Leu Lys Arg Thr Pro Arg Pro Ala Ser Thr His Asn Gly Ser Val Asp 

340 345 350 

Thr Glu Asn Asp Ser Cys Leu Gin Gin Thr His 

355 360 

0210:- 9" 

•-211:- 3 4 

02 12 • DNA 

ol'IG • Artificial Sequence 



■-'221 ■ mi sc_ feature 
<22?:- Novel Sequence 



'--4 00-* 9"? 

gatctctaqa atggagtcct cacccatccc ccag 34 

Paae SB 



WO 111 



P( T/l S00/3I5H9 



2 1 0 > 


98 


21 i> 


36 


'212'* 


DNA 


-■2 13.- 


Artificial Sequence 


' 2 0 '■* 




:221:- 


nasc_feature 


:223> 


Ncvel Sequence 



:4C0> 


98 


jatcqatatc 


210:- 


99 


211:- 


2610 




DNA 


2 1 3 : ■ 


Homo 


■100. 


ci 9 



36 



..tcq-.gtccr cacccatccc ccagtcatca gggaactctt ccactttggg gagggtccct 6 0 

caad(.cccag gtccctctac tgccagtggg gtcccggagg tggggctacg ggatgttgct 120 

t cggciatct; tggccctctt cttcatgctc ctgctggact tgactgctgt ggctggcaat 180 

cccgctgrq-i tggccgtgat cgccaagacg cctgccctcc gaaaatttgt cttcgtcttc 240 

cacctctgcz t.ggtggacct gctggctgcc ctgaccctca tgcccctggc catgctctcc 300 

agctcigcc-: rctttgacca cgccctcttt ggggaggtgg cctgccgcct ctacttgttt 360 

ctgaccgtg: gctttgtcag cctggccatc ctctcggtgt cagccatcaa tgtggagcgc 420 

tac:ct:a:i ugtccaccc catgcgctac gaggtgegea tgacgctggg gctggtggcc AHO 

tctg:gctn tgggtgtgtg ggtgaaggcc ttggccatgg cttctgtgcc agtgttggga 540 

agggtctcct gggaggaagg agctcccagt gtccccccag gctgttcact ccagtggagc 6u0 

cacaatgccr actgccagct ttttgtggtg gtctttgctg tcctttactt tctgttgccc 660 

ctgztcctca tacttgtggt ctactgcagc atgttccgag tggcccgcgt ggctgccatg 720 

cag:acgggc cgctgcccac gtggatggag acaccccggc aacgctccga atctctcagc lh'0 

agccgctc:a cgatggtcac cagctegggg gccccccaga ccaccccaca ccggacgttt 84 0 

99gggagggu aagcagcagt ggttctcctg gctgtggggg gacagttcct gctctgttgg 900 

ttgccctac*. tctctttcca cctctatgtt gecctgagtg ctcagcccat ttcaactggg 9^0 

caggtggaga gtgtggtcac ctggattggc tacttttget tcacttccaa ccctttcttc 1020 

tatggatgtc tcaaccggca gatceggggg gagctcagca agcagtttgt ctgcttcttc 10f0 

aagccagctc cagaggagga gctgaggctg cctagccggg agggctccat tgaggagaac 1240 

ttcctgcagt tccttcaggg gactggctgt ccttctgagt cctgggtttc ccgaccccta 1200 

cccagcccca ageaggagee acctgctgtt gactttcgaa tcccaggcca gatagctgag 1260 

gagacctctg agttcctgga gcagcaactc accagcgaca tcatcatgtc agacagctac 1320 
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•at ccgt c rtc ccqrctc.ic c c c q q a t. q g a q tcaqcgaiat c t. q r q a a t t , • c a r :: a c a c t 1 3 a 1 1 

qqacti'jg ig:; at "cgaocic q q t a a o a a q a t t q q g c t q c a q g Z c g a t o c: q c t a ;; a t c a q a 1^4 0 

: a aaq * zt. q a a c q a g q a . : a a q c g a a u c q a qaqaagqcgc a q c q c g a a o a c a u c a a a a a q la 0 0 

a t c g a g a c q a ac:tgcagaa q g a c a , a qcag qtctaccggg c a a c q c a a a g ccrqctgc: q la 6 o 

c t g q g t q c t q gaqagtctgg a a a a g c a c a a 1 1 g t g a age a gatcaqqat car a a a t g 1 1 1 6 a 0 

a at gggttta acqgagaggg eggeqaagag gac c eg ca gg atgeaaggag aaacagegat lhoC 

gqtgagaagg ccaccaaagt q:agga:ata: aaaaacaacc tgaagqagge cattgaaacc 1" A') 

attgtggcag aca tgagcaa a:tggtgccc cccgtggagc tggccaa;-:c tgagaaccag 1 0 0 

tt3agagtgq aciacattct qagcqtgatg aacgtgccaa actttqaatt c:cacctgaa IK 6 0 

ttctatgag : atgecaagge tctgtgggag gatgagggag Ltcgtgc ::tg ctacgagcqc lOla 

raacaacgagi aecagctgat cgactgtgcc cagtacttcc tqgacaagat tgatgtgatc I 'HHO 

aagaaggc-ag actacgtgcc aagtgaccag gacctgcttc getgecgagt >a:tgac:tc: 2 04 0 

ggaatctttg agaccaagtt ccaggtggac aaagtcaact tccacatgtt cgatgtgggc 1100 

ggccagcgcg atgaacgccg caagtggatc cagtgettea atgatgtgac tgccatcatc 2160 

ttcgtggtgg ceagcagoag at aca a cat g gtc^tccggg aggacaa rca gaccaaccgt '1:22 0 

ct gcaggagg ccctgaacct cttcaagagc atctggaaca aoagatggct gagtaccatc 1": Z 8 0 

tctgtgatac tcttcctcaa caagcaaga: ctgcttgctg agaaggtcct cgctgggaaa 2 2 -in 

tcgaagattg aggactactt tccag agate gctcgctaca ccactccrga ggatqegact I-IOO 

aacgagcccg gagaggaccc acgcgtgacc egggecaagt acttcatccg ggatgagttt 1*560 

;~ gaga a tea geactgetag tggagatgga cgteactact qctaccctca ctttacctgc 152 0 

qccgtggaea ctgagaacat ccgccgtgtc ttcaacgact geegtgacat catccagcgc 1580 

a* goat ct to gecaataega gctgctciaa 2 610 

• : : o • 100 

-.11- 869 
2 12 > FRT 
1 3 • Homo sapiens and Rat 

•10 0 • I'M) 

Me t G 1 u S or S *a r Pro 1 1 c Pro Gin S e r 3 e r G 1 y Asa Ser Ser T h r Leu 
1 5 10 1 5 

Giy Arg Yal Pro Gin Thr Pro Gly Pre Ser Thr Ala Ser Gly VaJ Pro 

2 0 2 5 30 

Glu Val Gly Leu Arg Asp Vai Ala Ser Glu Ser Val Ala Leu ?he Phe 
3 5 " 4 0 4 b 

Met Leu Leu Leu Asp Leu Thr Ala Val Ala Gly Asn Ala Ala Val Met 
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Ala Val lie Aid Lys T'::r Pro Ala Leu Arg Lys Phe Val Pho Val Phe 

61 ^ i. 1 7 b H C 

His Leu Cys Leu Val Asp Leu Leu Ala Ala Leu Thr Leu Met Pro Leu 

8 5 9 C 9 5 

Ala Met Leu Ser Ser 3er Ala Leu Phe Asp His Ala Leu Phe Gly Giu 
100 105 110 

Val Ala Cys Arc Leu Tyr Leu Phe Leu Ser Val Cys Phe Val Ser Leu 
115 120 ' l.'S 

Ala lie Leu Ser Val Ser Ala lie Asn Val Glu Arg Tyr Tyr Tyr Val 

130 135 140 

Val His Pro Met Arg Tyr Glu Val Arg Met Thr Leu Gly Leu Val Ala 
145 150 155 160 

Ser Val Leu Val Gly Val Trp Val Lys Ala Leu Ala Met Ala Ser Val 
165 170 175 

Pro Val Leu Gly Arg Val Ser Trp Glu Glu Gly Ala Pro Ser Val Pro 
180 185 190 

Pro Gly Cys Ser Leu Gin Trp Ser His Ser Ala Tyr Cys Gin Leu Phe 
195 200 205 

Val Val Val Phe Ala Val Leu Tyr Phe Leu Leu Pro Leu Leu Leu lie 

210 215 220 

Leu Val Val Tyr Cys Ser Met Phe Arg Val Ala Arg Val Ala Ala Met 

225 230 235 240 

Gin His Gly Pro Leu Pro Thr Trp Met Glu Thr Pro Arg Gin Arg Ser 
245 250 255 

Glu Ser Leu Ser Ser Arg Ser Thr Met Val Thr Ser Ser Gly Ala Pro 
260 265 270 

Gin Thr Thr Pro His Arg Thr Phe Gly Gly Gly Lys Ala Ala Val Val 
275 280 285 

Leu Leu Ala Val Gly Gly Gin Phe Leu Leu Cys Trp Leu Pro Tyr Phe 

290 295 300 

Ser Phe His Leu Tyr Val Ala Leu Ser Ala Gin Pro lie Ser Thr Gly 
305 110 315 320 

Gin Val Glu Ser Val Val Thr Trp lie Gly Tyr Phe Cys Phe Thr Ser 
325 330 335 

Asn Pro Phe Phe Tyr Gly Cys Leu Asn Arg Gin lie Arg Gly Glu Leu 
340 345 350 

Ser Lys Gin Phe Val Cys Phe Phe Lys Pro Ala Pro Glu Glu Glu Leu 
3 55 3 60 3 65 

Arg Leu Pro Ser Arg Glu Gly Ser lie Glu Glu Asn Phe Leu Gin Phe 
370 375 380 

Leu Gin Gly Thr Gly Cys Pro Ser Glu Ser Trp Val Ser Arg Pro Leu 

385 390 395 ~ 400 

Pro Ser Pro Lys Gin Glu Pro Pro Ala Val Asp Phe Arg lie Pro Gly 
405 410 415 
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Gin Tie Ala olu Glu Tnr Ser Glu Phe Leu Glu Gli. i ! :. Leu Thr 3e: 

4 2 0 4 2 5 4 3 0 

Asp lie lie Met ;>i Asp Ser Tyi Leu Arq F r o Ala Ala Jor Pre Arc 

4 3 0 4 4 0 " 4 4 5 

Lev: 31u Ser Ala Tie Ser Aid Glu Phe His His Thr Gly Leu Vai Asp 

4 50 4 55 4 60 

P r o Ser S e r V a 1 P r o S e r Leu Gly Cys A r g Ser Met G 1 y C y s Leu G 1 v 

465 470 475 " 480 

Asn Ser Lys Thr Glu Asp Gin Arq Asn Glu Glu Lys Ala Gin Arg Glu 

4 85 4 90 4 95 

Ala Asn Lys Lys lie Glu Lys Gin Leu Gin Lys Asp Lys Gin Val Tyr 

500 505 510 

Arq Ala Thr His Arg Leu Leu Leu Leu Gly Ala Gly Glu Ser Gly Lys 

515 520 525 

Ser Thr lie Val Lys Gin Met Arg Tie Leu His Val Asn Gly Phe Asn 

530 535 540 

Gly Glu Gly Gly Glu Glu Asp Pro Gin Ala Ala Arg Ser Asn Ser Asp 

545 550 555 560 

Gly Glu Lys Ala Thr Lys Val Gin Asp lie Lys Asn Asn Leu Lys Glu 

565 570 575 

Ala lie Glu Thr He Val Ala Ala Met Ser Asn Leu Val Pro Pro Val 

580 585 590 

Glu Leu Ala Asn Pro Glu Asn Gin Phe Arg Val Asp Tyr I 1. e Leu Ser 

595 600 ' 605 

Val Met Asn Val Pro Asn Phe Asp Phe Pro Pro Glu Phe Tyr Glu His 

610 615 * 620 

Ala Lys Ala Leu Trp Glu Asp Glu Gly Val Arg Ala Cys Tyr Glu Arq 

625 630 635 640 

Ser Asn Glu Tyr Gin Leu He Asp Cys Ala Gin Tyr Phe Leu Asp Lys 

645 65 0 65 5 

He Asp Val He Lys Gin Ala Asp Tyr Val Pro Ser Asp Gin Asp Leu 

660 665 670 

Leu Arg Cys Arq Val Leu Thr 3er Gly He Phe Glu Thr Lys Phe Gin 

6~ 7 5 680 685 

Val Asp Lys Val Asn Phe His Met Phe Asp Val Gly Gly Gin Arg Asp 

690 695 700 

Glu Arg Arg Lys Trp lie Gin Cys Phe Asn Asp Val Thr Ala He He 

7 0 5 " ' 710 715 7 2 0 

Phe Val Val Ala Ser Ser Ser Tyr Asn Met Val lie Arg Glu Asp Asn 

7 2 5 7 3 0 7 3 5 

Gin Thr Asn Arq Leu Gin Glu Ala Leu Asn Leu Phe Lys Ser He Trp 

7 4 0 7 4 5 7 5 0 



Asn Asn Arg Trp Leu Arg Thr He Ser Val He Leu Phe Leu Asn Lys 
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"55 7 60 ie c , 

Gin Asp Leu Leu Ala Glu Lys Val Leu Ala Gly Lys Ser Lys He Glu 

7 7 0 7 7 5 n 8 J 

Asp Tyi Phe Pro Glu Phe Ala Arg Tyr Thr Thr Pre Glu Asp Ala Thr 
765 790 " " 795 800 

Pri Glu Pre Gly Glu Asp Pro Arg Val Thr Arg Ala Lys Tyr Phe He 

805 810 815 

Arg Asp Glu Phe Leu Arg He Ser Thr Ala Ser Gly Asp Gly Arg His 
820 825 " 830 

Tyr Cys Tyr Pro His Phe Thr Cys Ala Val Asp Thr Glu Asn He Arg 
835 840 845 

Arg Val Phe Asn Asp Cys Arg Asp Tie He Gin Arg Met His Leu Arg 
S50 ' 855 860 

Gin Tyr Glu Leu Leu 
8 65 



<2 


10 > 


101 




11> 


30 


<z 


12> 


DNA 


< ' 


13> 


Artificial Sequence 




2 0> 






2 1 > 


misc feature 


< J_ 


23> 


Novel Sequence 


<4 




101 


tc 


tagaatga cgtccacctg caccaacagc 


<2 


1C)> 


102 




i:> 


34 






DNA 


< j 




Artificial Sequence 


<^ 


2 0> 




<2 




misc feature 


< ^ 


23> 


Novel Sequence 



30 



<400> 102 

gatategcag gaaaagtagc agaategtag gaag 34 

<H0> 103 
<211> 2781 
<212> DNA 

<213> Homo Sapiens and Rat 
<400> 103 

atgaegtcca cctgcaccaa cagcacgcgc gagagtaaca gcagccacac gtgcatgccc 60 
ctctccaaaa tgcccatcag cctggcccac ggcatcatcc gctcaaccgt gctggttatc 120 
ttcctcgccg cctctttzgt eggcaacata gtgctggcgc tagtgttgca gcgcaagccg 180 
cagctgctgc aggtgaccaa ccgttttatc tttaacctcc tcgtcaccga cctgctgcag 240 
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ci V. 1 r. e g <: t c: g r ggceceerg gqr ggt g g c: t ic:.tctn:g c ctctcrtcta g c c c c t. c a a o 3 G 0 

acjccdcrtct qcacgqccrt q^taqcctc: a:ccacc:gt tcq:::rcqc cagcqLcaac j^i 

accattqtc: r.ggtgtcagt gpatcnctac rtqtcca:ca :ccaocctct ctccta:ccq 4 1 0 

t.ccaa:atqa cocaqeqecg c^cttacctq ctcct:tar.:] qcacctggar tgrqqccatc 4-^0 

c t qca •.: age a c i:c:ccact c:t.accqctgg g g e c a -g g e t g e c r z ■; . g a t g a qcgcaa: g c r .:■ 4 v j 

cretget. c c a r. g a t c t g g g g gjccaqcccc agcta:ac: a r t c r. ; a q eg t g g t g t ccttc n ' » 0 

at'jgt^attc ca:tgattgt catgattgee tgctactccg Lggrgrrcrg tgcagcccgg 6»)0 

aggcagcatg cretgctgta caatgtcaag agaca eager rggaagtgcg agtcaaggac -'2 0 

tqtgtgqaga atgaggatga agagggagca gagaagaagg aqgagttcca ggatgagagr 7 HO 

gagtrtegee geeageatga aggtgaggtc aaggc:aagg agggeagaar ggaagccaaq H40 

^acqgoaqcc tgaa3?::aa qqaaqqaaqc acqqqqa::a qtqaqagtac tgtagaggc: ^ >■'! 

agggg^agcg aggaggteag aga gag rage aeggtggeea gegaegqcag catggagggt 

aaggaaggca geaccaaagt rgaggagaac agcargaagg eagaeaaggg tegcacagag 1020 

gtcaaccagt gcagcatrga cttgggrgaa gatgaeargg agrrrggega agacgacat: 10*0 

aa:ttcagtg aggatgaegt ogaggcagtg aacatecegg agagcctccc acccagtcgt 1140 

egraaeagca acagea aecc rcctctgccc aggrgeraee agrgeaaagc tgctaaagtg 11' 00 

atetteatca tcattrtetc ctatgtgcta tccctgggge cctactgctt tttagcagt: 12 ml 

erggcegrgr gggtggargt cgaaacecag gtaccecagt gggtgatcac cataatcatc IV'0 

rggcttztct tcctg:agtg etgcatccac ccctaigtct atggctacat geacaagace 1.380 

attaagaagg aaatc:agga ~atgctgaag aaqtt:ttct geaaggaaaa gecceegaaa I*;*;'"' 

gaagatagee acccagacct geceggaaca gagggrggga ctga.iggcaa gattgtccct 100L' 

rccraegatr crgcraerrt tectgegata tetgeagaat recaccacac tggacragtg 15*".0 

gatccgagct eggtaccaag crtgggetgc aggtcgargg gregeetegg caacagtaag 1 o. o 

aecgaggaec agegcaaega qgagaaggcg cagegeqagg c:aacaaaaa gatcgagaaq lu>-0 

eagetgeaga aggaeaagca egtctaeegg gecacgeaee girctgetget gctgggtgct I" : 4l 

gqagagtctg qeaaaagcac cattgtgaag cagatgagga rectaeatgt taatgggtrt 1 ■ ' > : 

aaeggagagg qegqegaaga ggacccgcag getgeaagoa geaaoagega tggtgagaag IS 

qccaccaaag tgcaggacat eaaaaaeaac ctgaaggagg cc atrgaaac cattgtggcc 1'?1C 

geeatgagca acctggtqcc ceecgrggag ctggceaacc ctgagaacca qrtcaqagtg I'^O 

gacta :.3t:c tqagcgtgat gaacgtgcca aactttgact tcccacctqa attctatgag 20 4 0 

catgc:aagg ctcrgtggga ggatgaggga gttcgtgcct getaegageg ctccaacgag 2 1 ML 

raecagetga tcgactgtgc c:agtacttc ctggacaaga trgatgtgat caagcaggcc 2^*50 

gaetaeqtqc caagtqacca qqacctqett cqctqccqcq tcctqacctc tqqaatcrtt 22 2 0 

Paqe 64 
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g a g a o caagt tccaqgrgga caaagt caac t tccaca tgt t cgatgtqgg cqacc:±acgc 2 

aatgaacgcc gcaagtggat ccagtgcttc aatgatgt^a ctqccatcat cttcgtgytg 2 340 

gccagcagca gctacaacat ggtcatccgg gaggacaacc agaccaaccg tctqcaqgag '.'4 00 

cctcrgaacc tcticaagag catctggaac aacaga:ggc tgcgtaccat ctctcr. qatc 2 4 60 

r.ct^cctca acaagcaaga tctgcttgct gagaaggtcc tcgctgggaa atcgaagat: 2S2C 

gaggactact rtccagagtt cgc'cgctac accactcctg aggatgcgac tcccgagccc 2 58 0 

ggagaggacc cacgcgtgac ccgggccaag tacttcatcc gggatgagtt tctgagaatc 2640 

agcactgcta gtggagatgg acgtcactac tgctaccctc actttacctg cgccgtggac 2700 

d..:tgagaaca tccgccgtgt cttcaacgac tgccgtgaca tcatccagcg catgcatctt ^760 

cgccaatacg agctgctcta a 2781 

: 2 10.- 104 
2:11> 926 
<212> PRT 

<213> Homo sapiens and Rat 

•:400> 104 

Met Thr Ser Thr Cys Thr Asn Ser Thr Arg Glu Ser Asn Ser Ser His 
15 10 15 

Thr Cys Met Pro Leu Ser Lys Met Pro lie Ser Leu Ala His Gly lie 
20 25 30 

He Arg Ser Thr Val Leu Val lie Phe Leu Ala Ala Ser Phe Val Gly 
35 40 45 

Asn Tie Val Leu Ala Leu Val Leu Gin Arg Lys Pro Gin Leu Leu Gin 

5 0 5^ 60 

Val Thr Asn Arg Phe lie Phe Asn Leu Leu Val Thr Asp Leu Leu Gin 

65 70 75 80 

lie Ser Leu Val Ala Pro Trp Val Val Ala Thr Ser Val Pro Leu Phe 
85 90 95 

Trp Pro Leu Asn Ser His Phe Cys Thr Ala Leu Val Ser Leu Thr His 
100 105 110 

Leu Phe Ala Phe Ala Ser Val Asn Thr lie Val Val Val Ser Val Asp 
115 120 125 

Arg Tyr Leu Ser lie lie His Pre Leu Ser Tyr Pro Ser Lys Met Thr 
130 135 140 

Gin Arg Arg Gly Tyr Leu Leu Leu Tyr Gly Thr Trp He Val Ala He 
145 150 155 160 

Leu Gin Ser Thr Pro Pro Leu Tyr Gly Trp Gly Gin Ala Ala Phe Asp 
16S 170 175 

Glu Arg Asn Ala Leu Cys Ser Met He Trp Gly Ala Ser Pro Ser Tyr 
180 185 190 
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Tiii II*-.- Leu Ce: Val Val Se: Pht- Tie Val lie Pro Leu II*.. Veil M-t 

195 :'0u 2C5 

lit Ala Cys Tyr Ser Val Val Phe Cys Ala Ala Arq Arc: Gin Hi? Ala 

210 ' 215 220 

Leu Leu Tyr Asn Val Lys Arc: :iis L'or Leu Glu Val Arc: Val Lys Asp 

22: ' 2 30 2 3 5 2 4 0 

■"y.s Val G^u Asn Glu Asp Clu Glu Gly Ala Glu Lys Lys Glu Glu F'he 
245 250 255 

31. a Asp Glu Ser Glu Phe Arg Arg Gin His Glu Gly Glu Val Lys Ala 

2 6C 2 05 270 

Lys Glu Gly Arg Met Glu Ala Lys Asp Gly Ser Leu Lys Ala Lys Glu 

2 7 5 2 ^ 0 2 8 5 

G 1 y Ser Thr Gly T h r S e r Glu Ser S o r Val Glu Ala Arg G 1 y Sec G 1 u 

'2 90 2 9b 3 00 

Glu Val Arq Glu Ser Ser Thr Val Ala Ser Asp Giy Ser He: Glu Gly 

30.o 31 0 31 5 3 2 0 

Lys Glu Gly Ser Thr Lys Val Glu Glu Asn Ser Met Lys Ala Asp Lys 
325 330 335 

Gly Arq Thr Glu Val Asn Gin Cys Ser lie Asp Leu Gly Glu Asp< Asp 

34 0 34 5 3 50 

Met. Gl u Phe Gly Glu Asp Asp lie Asn Phe Ser Glu Asp Asp Val Glu 

3 5 5 3 00 3 b 5 

Ala Val Asn lie Pro Glu Ser Leu Pro Pro Ser Arc Arg Asn Ser Asn 

.3 "7 0 3 7 5 3 8 0 

Ser Asn Pro Pro Leu Pro Arg Cys Tyr Gin Gys Lys Ala Ala Lys Val 

385 390 395 400 

lie Phe lie He He Phe Ser Tyr Val Leu Ser Leu Gly Pro Tyr Cys 
405 410 415 

Phe Leu Ala Val Leu Ala Val Trp Val Asp Val Glu Thr Gin Val Pro 

420 425 ' 4 3C 

Gin Trp Val He Thr He lie lie Trp Leu Phe Phe Leu Gin Cys Cys 

4 3 5 4 4 0 4 4 5 

He His Pro Tyr Val Tyr Giy Tyr Met His Lys Thr He Ly.: Lys Glu 

4 5 2* 4 5 5 4 60 

He Gin Asp Met Leu Lys Lys Phe Phe Cys Lys Glu Lys Pro Pro Lys 

465 470 475 480 

31 u Asp Ser His Pro Asp Leu Pro GJy Thr Glu Giy Gly Thr Giu Giy 
4 85 4 90 4 95 

Lys He Val Pro Ser Tyr Asp Ser Ala Thr Phe Pro Ala He Ser Ala 

500 505 510 

Glu Phe His His Thr Gly Leu Val Asp Pro Ser Ser Val Pro Ser Leu 

515 52 0 52 5 

Giv Cys A.rq Ser Met Gly Cys Leu Gly Asn Ser Lys Thr Glu Asp Gin 

530 ' ' 535 540 
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Asn Glu Glu Ly.s hi ,1 Gin Arg Glu Ala Asn Lys Lvs lie Gju Lys 
54 5 550 ' 555 " 56u 

Gin Leu Gin Lys Asp Lys Gin Veil Tyr Arg Ala Thr His Arg Leu Leu 

565 570 ' 575 

Leu Leu Gly Ala Gly Glu Ser Gly Lys Ser Thr lie Val Lys Gin Met 

580 555 590 

Arg lie Leu His Val Asn Gly Phe Asn Gly Glu Gly Gly Glu Glu Asp 
595 600 605 

Pro Gin Ala Ala Arg Ser Asn Ser Asp Gly Glu Lys Ala Thr Lys Val 
610 615 620 

Gin Asp lie Lys Asn Asn Leu Lys Glu Ala lie Glu Thr lie Val Ala 
62.5 630 635 640 

Ala Met Ser Asn Leu Val Pro Pro Val Glu Leu Ala Asn Pro Glu Asn 

645 650 655 

Gin Phe Arg Val Asp Tyr lie Leu Ser Val Met Asn Val Pro Asn Phe 
660 665 670 

Asp Phe Pro Pro Glu Phe Tyr Glu His Ala Lys Ala Leu Trp Glu Asp 

675 680 685 

Glu Gly Val Arg Ala Cys Tyr Glu Arg Ser Asn Glu Tyr Gin Leu lie 
690 695 700 

Asp Cys Ala Gin Tyr Phe Leu Asp Lys lie Asp Val lie Lys Gin Ala 
705 " 710 715 720 

Asp Tyr Val Pro Ser Asp Gin Asp Leu Leu Arg Cys Arq Val Leu Thr 

725 730 ~ " 735 

Sex Gly lie Phe Glu Thr Lys Phe Gin Val Asp Lys Val Asn Phe His 
740 745 750 

Met Phe Asp Val Gly Gly Gin Arg Asp Glu Arq Arg Lys Trp lie Gin 
755 760 765 

Cys Phe Asn Asp Val Thr Ala lie lie Phe Val Val Ala Ser Ser Ser 
770 775 780 

Tyr Asn Met Val lie Arg Glu Asp Asn Gin Thr Asn Arg Leu Gin Glu 

785 790 795 800 

Ala Leu Asn Leu Phe Lys Ser lie Trp Asn Asn Arg Trp Leu Arg Thr 

805 810 815 

lie Ser Val lie Leu Phe Leu Asn Lys Gin Asp Leu Leu Ala Glu Lys 
820 825 830 

Val Leu Ala Gly Lys Ser Lys lie Glu Asp Tyr Phe Pro Glu Phe Ala 
835 840 845 

Arg Tyr Thr Thr Pro Glu Asp Ala Thr Pro Glu Pro Gly Glu Asp Pro 
850 855 860 

Arg Val Thr Arg Ala Lys Tyr Phe lie Arg Asp Glu Phe Leu Arg lie 

865 87C 875 880 



Ser Thr Ala Ser Gly Asp Gly Arg His Tyr Cys Tyr Pro His Phe Thr 
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Cys Ala Val Asp Thr GIu As:; lie Arg Arg Val Phe Asn Asp Cys Arg 

9 C L 1 9 0 5 1 J • 

A G p I Le 1 .1 e Gin Arq Met iii.s Leu Arg Gin Tyr Glu Leu Leu 

915 9 2 0 92b 

•• 210 • 105 

: 2 .1 2 : ■ DMA 

w 1 3 • Artificial Sequence 
220- 

221:- misc_feature 

223:- Novel Sequence 

.4 00 ■ 1 05 

1 1 g * a t gcc agcgt cc t gc tec 2 3 



] 2 2)NA 

13 Artificial Sequence 



.21 mi sc feature 
■ ^ <_ Novel Sequence 

_j 1 t a * g c - - r ^ a a g c c a g t c t t g t g 



.SI ():■ 10V 

::il 2"> 

:;" ; 2 : ON A 

; 13 ■ Ar uLcial Sequence 
■120 ■ 

3 1 .1 * • iro :sc _f ea t ure 

. 1 , 3 N o v e 1 S e q u e r; c e 



400 107 
jccicrtqctc ctgagcacct tctcc 

.2I0-- 109 



DNA 

Artificial Seqeunce 



•20 • 

:2i> n: sc_ feat ure 
12 3"- Novel Sequence 



:'f00:- 109 

i-acacKiqctq caqccctgca gctggc 2 6 



2 1 0 > 10 9 
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21 
DNA 

Artiticifii oe-que n c e 




mi sc_featurc 
Novel Sequence 



• 4 00 



10 9 

g a t g a ct^tgtccag cctg 



24 



■2 10.- 110 

• 211 ■ 21 

■ 212 DNA 

•213'- Artificial Sequence 
2 2 0 - 

• 221 ■ m.LSC_f eature 
22 3' Novel Sequence 

• 'J 0 0- 120 

cdqacacttg gcagggacga ggtg 2 1 

-210/ 111 

<211 ■ 20 

2 12- DNA 

'2 i V- Artficial Sequence 



* ^10-' 112 
^'2 II ■■ 21 
*.2'12> DNA 

■ 212 * Artif icial Sequence 
'■.220> 

- 221 • misc_f eature 

• 222 * Novel Sequence 

*■ 4 00 • : 12 

cataiccctc cgagtgtcca gcggc 25 

• 2 1C- 213 
< j 1 1 ■• 2 4 

■ 212 > DNA 

*-213> Artificial Sequence 
<.2 2 0> 

**221> mi sc_f eature 

* 223'.- Novel Sequence 



0 



nu sc_f eature 
Novel Sequence 



•■ 4 0 0 > 1 1 1 

c t rgtggt ct actgcagcat grtccg 



26 
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■ A 0 ' ")> 11 ..s 

■iraiatcct: urrarcrt: crrc ,' -1 

• ^ x : • ll'i 

< 2 I 1 • 2 7 

:212 ■ UNA 

2 i 3 • Arti t l ridl Sequence 

2 2 0 - 

22 1 - rr.isc_rtratuie 

2 2 3- I J c v e 1 S e q u e ri c e 

-100> .114 

raaqa acagq tctca tctac gaqct cc 71 

210- i 15 

211- 26 

2 1 2 ■• DNA 

.' ] i • M.r 1 i 1 1 c l a 1 Sequence 

:22!'- mi sc_f ea t u re 

•'.2 2 3*' Novel Sequence 

■100-* 115 

■:tci.qatgcc atctgctgga ttcctg 2 6 

: 2 1 0 - 12 6 

• 2 1. ■ 2 6 
■212 ■ DNA 

- . j 1 -i • Artificial Sequence 
■ : 2 2 0 ' 

■ 2 2 J • ni sc_feature 

■ . 2 2 v ■ N o v el Sequcn c e 

• A 0 0^ 116 

atdutccact gaaagtccag tqar.cc 2 6 

• j i 1 17 
-.211:- 2-' 
•2i;- DNA 

• 2 1 .'■ ■ Artificial Sequence 
• : 2 2 1 1 '* 

-.22 1> misr_feature 

■:J2J"- Novel Sequence 

•:.10n> 117 

vijgrgqcqat ngccaacdgc gctc 2 4 

< 2 1 iv • 118 

< 2 : 2 • DNA 

<2!'*s Artificial Sequence 
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• 2 2 0 > 

2 21 > ~ i sc_toature 

■ - .'. 3> Novel Sequence 

■ 4\'~0 ■ 1 IB 

a t a :gcct*; acjcgacagat gacc 2 4 

■ J_ . 0 ■ 119 

•211 23 

• : 2 • dna 

• 212 • Artificial Sequence 

•■ 2 : c > 

■ 23 1'* misc_feature 
•32 3'- Novel Sequence 

• -1 0 0 119 

• ..iici.^tata :agcagcatc etc 2 3 

- . 1 C • 12 3 

■ _ i 1 2 3 
3J2 ■ DNA 

- 1 3 ■ Artificial Sequence 

• 0 

■ 1 ^ - c:_: ea t ure 

• 3 3 N _ vei Seauence 



ad a j t .•: gc agaatggtta gec 



12. 



-■2 13 Artificial Sequence 
-320 • 

•-_21 * niisc_ieature 

-■■22 3 Novel Sequence 



<A00 • 121 

qaca-/ ::*_gtc ageggtegtg tgtg 



<210- 12: 
■-J 11:- 27 



<- 2 2 0 
<221 
<223 



Artificial Sequence 



m:sc_feature 
Novel Sequence 



<400> 122 

ctgatggaag tagaggctgt ccatctc 
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21 1 



A i L i c l ii 1 S e a \ i ^ n c * 



nisc_f e a t urc 
^ovel S*.;que:ic 



•; 0 0 • 12 3 
gcigaocg c a g a c c a q t g qctq 



; 4 



^ I 0 > 12 4 

2 li- i a 

112 • DNA 

1 3 - .A rtif icidl Sequence 

^20 ■ 

_2 1 • nusc_f ear. ure 

j 2 J ■ novel Sequence 



400 ■ 124 

icggtgacg aagggcacga gctc 



24 



j .1 

DC A 

Art if ic 



ial Seauence 



: 2 1: 
'"2 -2 



m: s c_:e a r. ure 
Novel Seauence 



gcr.jtccct. gccaggaaqt: atqg 



lo: 126 

. 1 i : • ^2 

.12" DNA 

. 1 .V Artificial Seauence 



.2 1- n\: sc_f ea t. ure 
2 ■ Novel Seauence 



■naatagat gtqcagcaca atggc 



iA' " 1 . 

11: 2 : 



zi.- 2>na 

13 Artificial Sequence 



:21> mi sc_ feat ure 
l23-- Novel Seauence 
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-4 00> 12' 7 

' *.: < j c t c a a c a qcgctggttg g c a a c 



• 2 .0> 


128 


• 2 11> 




^ - ' 


DNA 


' I. J ■ 


Artificial Sequence 


• 2J0:- 

•■ ;: ::i • 


m:sc feature 


• : 2 ' ^ > 


Novel Sequence 


■■4mC> 


128 




tcta gacicdtggt gatcc 


• 2 1 0> 


129 


■" 2 i 1 : • 


6 


- 212 




• 2 . 3.- 


Artificial Sequence 


- ::o..- 




< 2 2 1 > 


mi sc_f eature 


<223 > 


Novel Sequence 



-■ 4 00> 


1 2 9 


Tht L 


eu Glu Ser lie Met 

5 


0> 

^ : : i > 
-.: i 2> 
O i 3> 


130 
PPT 

Artificial Sequence 


■-220' 
< 2 2 1 > 
<2.;3> 


misc feature 
NC'Vel Sequence 


<10 0> 


130 


Glu Tyr Asn Leu Val 

1 5 


--21 0> 

< 2 1 1 > 

< 2 1 2 > 

< 2 1 3 > 


131 

5 

PFT 

Artificial Sequence 


<220> 
< 22 1:- 
<223> 


misc feature 
Ncvei Sequence 


< 4 00> 


131 


Asp C 
1 


ys Gly Leu Phe 

5 


<210> 


132 



2 5 



Page 73 



WO 0 1 3<»-P I 



IH I I SOO/3150*) 



L-KT 



<2 11- misc fed:u:e 
■;2 2 3"- Novel Sequence 



.-ii'C' 131: 

31 y Ala Thr Cys Ala Ala Gly Cys Thr Thr Cys Cys Ala Thr Gly Gly 
1 5 1 0 15 

3ys C.y Thr Gly Cys Thr Gly Cys Cys Thr Gly Ala Gly Cys Gly Ala 

20 25 30 

Gly Gly Ala Gly 
3 5 

210 - J 33 

211 - 5 3 

212 * PRT 

213 > Artificial Sequence 
220> 

221'* mis cofeature 
223> Novel Sequence 



•AQQ:- 233 

Gly Ala Thr Cys Gly Gly Ala Thr Cys Cys Thr Thr Ala Glv Ala Ala 
1 5 10 15 

Cys Ala Gly Gly Cys Cys Gly Cys Ala Gly Thr Cys Cys Thr Thr Cys 
20 25 30 

Ala Gly Gly Thr Thr Cys Ala Gly Cys Thr Gly Cys Ala Gly Gly Ala 
3 5 4 0 4 5 

Thr Gly Gly Thr Gly 

5 0 



Page 74 



( i: > im i kn vi iowi utik v'i' ion rt in ism n i ndfr i hi r vi i vi ( oon r vi ion tri vi tin 1 » 



( I 1 ) | World lnu lkH lu.il Property Organization 

Inter "iiiii 1* >nal Htn cm 

(43) International l*u blication Date 
25 M;i\ 2001 (25.05.2001 ) 



PCT 



(10) lnicrii;i(ional Publication Nuinhcr 

WO 01/36471 A3 



(rT) International Patent ( .'lassifieatioii : 

CI2\ I VI 2 



( 07K N/705 



o< t/242. -1 > 
oo.'24 >.i ) F> 



2(> t >ctohcr 2ooo i 2*i. |o.2< kio- 1 S 
24 ( >e(obcr 20(H) i 24 |o.2ooih ( S 



(21) International Application Number: PC 1 71 S( )0/5 1 5< >9 



( — ) 


International Filing Dale: 














1 f 1 \m\ 


•nib 'i 


"•(MX) 


1 ( , j ] 


2oot h 


\ — ^ t 


Kiliii 1 * Lanjiiia^e: 








I- 


1 1 l4 I S 1 1 




Publication Laii" 


u a«c: 






p 


io'IisIi 


(30) 


Priori! \ Data: 














60/166.088 


17 No\ ember 


1 uug 


17.11 


[990) 


IS 




60/166.099 


1 7 November 


1 999 ( 


1 7. 1 1 


1 999 ) 


IS 




60/166,569 


17 \i< >\ LMlllVt 


1 99U < 


1 7. 1 1 


1 999 ) 


I S 




60/1712)00 


25 Deeembei 


1999 


23 12 


I 999) 


IS 




60/171.901 


25 Dt'tvinhi'i 


1 OO W 


2 5. 12 


I 99W ) 


us 




60/171 .902 


2 ; 1 )L\VllltK J l 


I 99W 


2.5.12 


1 999 ) 


is 




60/ IK 1.749 


1 1 February 


20O0 


1 1 .02 


2000 t 


us 




60/189.258 


14 Mauli 


2iK)0 


14.05 


2000 ) 


us 




60/189.259 


14 March 


2(KH) 


14.03 


2000) 


us 




60/ 195. 8 98 


10 April 


2(KK) 


10.04 


2(MHb 


us 




60/195.899 


10 April 


2000 ( 


10.04 


20(H)) 


us 




60,496.078 


10 April 


2(HH)i 


10.04 


20( )0) 


us 




60/2(H>.4 19 


28 April 


2000 < 


28. 04 


20( MO 


us 




60/203.630 


12 VI ay 


2000 ( 


1 2.05 


2000) 


us 




60/210.741 


12 June 


2000 < 


1 2.06 


20( )()) 


us 




60/210.982 


12 June 


2000 < 


1 2.06 


2000) 


us 




60/226.760 


21 August 


20( >0 


2 1.08 


20( >0) 


us 




60/235.4 IS 


26 September 


20( Ml 


26.09 


2000) 


us 




00/255.779 


26 September 


2(H)0 


26.09 


2000 ) 


us 




6( 1/242.332 


20 ( )etober 


20O0 


20. 10 


20(H)) 


us 



(71) Applicant (tor ail t/ewe^z/et/ Suites ( 'v, e/V I 'Sr. A RUN A 
PHARMACEUTICALS. INC. jUS/US|: 0lo6 Nancv 
kubje Drive. San Diceo. OA 92121 tUS) 

(72) Inventors; and 

(75) Inventors/Applicants ttor I S onivs: CHKN, Ruopin*i 

| UN/US |; 5296 Timber Kranch Way. San Diceo. CA 
92150 iPSi l)\N(i, llutmji. T. 1 1 S/l "S |. V^52 Oak Park 
Drive. San Dieea*. G\ 92Io5 iUS». I OW1TZ, Kevin. I*. 

jUS/USJ; SO.^I Uainmito dc Pi/./a #C. San Dice.o. ( A 
82108 (US). 

(74) Agents: MILLER. Suzanne, L. et al.: Woodcock Wash- 
burn Kurt/ Maekieuie/ .V N orris ULP. ( >ne Liberty Place - 
4oth Floor. Philadelphia. PA 19|05 (US) 

(SI) Designated Stales nhuioiuth*. AE. AG. A I.. AM. AT. AU 
AZ. HA. HH. RG. MR. BY. HZ. CA. CM. CN. CR. CU. CZ. 
Dli. DK. DM. DZ. EE. FS. Fl. GH. GD.GF. GH.GM. MR 
HI". ID. IF. IN. IS. JR. KF. KG. KP. KR. KZ. FC. LK. LR 
FS. LT. FU. FY. MA. Ml). MG, MK. MN. MW. MX. MZ 
N( >. NZ. PL. l^T. R( ). RF. SI). SE. SG. SF SK. SF. TJ. I\M 
TR. IT. TZ. U A. FG. FS. UZ. VN. VU. ZA. ZW 

(S4) Designated Stales uxyumah: ARIPO patent (GH. GM 
KF, FS. MW. MZ. SD. SF. SZ. TZ. UG, ZWi. Eurasian 
patent t AM. AZ. in'. KG. KZ. Ml ). RU.TJ.TM). European 
patent (AT. HE. CI F CY. DF. t)K. FS FF FU. GH. GR. IE 
IT. LU. MC. NL. PT SE. TR k OAPl patent <BF. HJ. CF 
CG. CF CNF GA. GN. GW ML MR \F. SN.TD.TG). 

/( 'on (tunc J on ncx! f>tigc' 



(54) Title: ENDOGENOUS AND NON-ENDOGENOUS VERSIONS OF HUMAN G PROTFI N-( ( )UPFFI > RECEPTORS 



< 



sO 
O 

O 



4000-, 



IP3 Assay in 293 Cells 




CMV 



RUP12 



f?" 7 ) Abstract: The m\ention iliselo>ed in thr. 
patent document relate*, to traiiMiiemhrane 
reeeptors. more particularly to a human 
G jiroiein-eoupleLl receptor tor which the 
endogenous li^and is uiiknov\ti ("orphan 
GPCR receptors"), and most particularly it* 
mutated on m-endoyenou^ ) \cimo?i- ot the 
human GPCRs tor evidence i>t constitutive 
acliv its. 



WO 01/36471 A3 II II III 1 111 111 II 111 I III 111 II 

Published: l<>r iw<> . . /,•< < > . - ,uui < >iih-r t ihh/\-\iuf /<>//v i\-u--' h> i/'h "<//(,-./- 

wilts niU-i'ininotnu >\-i>ari uihe \t*h - <>>.>< < >Jt ^ w//7 I h/vcviain >ns " df >/vw<"/ //;_' ,// Hir 

nm^ <>! l\u h ?\ >-jjiiii>- issue >>i ilw }'( 7 ( siizcite 

(88) Dale of pu hlicalion of the internatioitnt search report: 



INTERNATIONAL SEARCH REPORT 



intei jnal Application No 

PCT/US 00/31509 



A. CLASSIFICATION OF SUBJECT MATTER 

IPC 7 C07K14/705 C12N15/12 



According to international Patent Classification (IPO or to both national classification and IPt 



B FIELDS SEARCHED 



Minimum documenlahon searched (ciassiitcaiion system followed Dv classification symbols i 

IPC 7 C07K 



fxjajmentalion searched other than minimum uocumentaiton 1o the extent that such oocumenis are included in trie lields searched 



Electronic data base consulted during the inlernahonai search (name ol data base and where practical searcn terms used) 

EPO-Internal , EMBL , STRAND , WPI Data. EMBASE. CHEM ABS Data, MEDLINE . PAJ, BIOSIS 



C. DOCUMENTS CONSIDERED TO BE RELEVANT 



Category ° 


Citation ot document wrth indication, wtiere appropnate. ot the relevant passages 


Relevant lo claim No 


X 


WO 98 46620 A (MILLENNIUM PHARM INC) 


1-4 




22 October 1998 (1998-10-22) 






claim 1; figures 1A,,2A; example 8 




X 


US 5 891 720 A (WOOLF ELIZABETH A ET AL) 


1-4 




6 April 1999 (1999-04-06) 






Sequence No. 2 






abstract 






-/-- 





Further documents are listed in me continuation ot box C 



Patent lamiiy members are listed in annex 



° Special categories of a ted documents 

'A' document defining the oeneral state of the art which is not 

considered to be of particular relevance 
"E" earber document but published on or atler the international 

tiling date 

"L" document which may throw doubts on pnontv claimis) or 
which ts crted to establish the publication date ot another 
citation or other special reason (as specified) 

"O" document referring lo an oral disclosure use exhibition or 
other means 

*P' document published poor lo the international filing dale but 
later than the pnonty dale claimed 



"T* laier document published after The international tiling date 
or pnoriv date and not in conflici with the application but 
cited to unoerstand the principle or theory underlying the 
invention 

"X" document ol particular relevance the claimed invention 
cannot be considered novel or cannot be considered to 
involve an invenlrve step when Ihe document ts taken alone 

"Y* document of particular relevance; the claimed invention 

cannot be considered to involve an invenlrve step when the 
document is combined with one or more other such docu- 
ments, such combination being obvious lo a person stalled 
in ihe art 

*&* document member of the same patent family 



Dale of the actual completion of the international search 

28 August 2001 



Uate ot mailing of the international search report 

I 9.09. 01 



Name and mailing address of ihe ISA 

European Patenl Office P B 5818 Paientlaan 2 
NL - 2280 HV Rnswiik 

TeL (+31-70) 340-2040 Tx 31 661 epo nl. 
Fax: (+31-70) 340-3016 



Authorized officer 



Meyer, W 



Form PC T /ISA/210 (second tfiMiM July 1992) 



INTERNATIONAL SEARCH REPORT 



Inter jnal Application No 

PCT/US 00/31509 



C. (Continue 


ation) DOCUMENTS CONSIDERED TO BE RELEVANT 


Category " 


Grtaiion ot document wiln indication wne re appropriate of the relevant passages 


Hetevanl 10 claim No 



STADEL J M ET AL: "Orphan G 

protein-coupled receptors: a neglected 

opportunity for pioneer drug discovery" 

TRENDS IN PHARMACOLOGICAL 

SCIENCES, GB. ELSEVIER TRENDS JOURNAL, 

CAMBRIDGE , 

vol. 18, no. 11, 

1 November 1997 (1997-11-01), pages 
430-437, XP004099345 
ISSN: 0165-6147 
abstract; table 1 

K JELSBERG M A ET AL: "Constitutive 
activation of the alphalB-adrenergi c 
receptor by all amino acid substitutions 
at a single site" 

JOURNAL OF BIOLOGICAL CHEMISTRY .AMERICAN 
SOCIETY OF BIOLOGICAL CHEMISTS, BALTIMORE , 
MD,US, 

vol . 267, no. 3, 

25 January 1992 (1992-01-25), pages 
1430-1433, XP002135768 
ISSN: 0021-9258 
abstract 

O'DOWD B F ET AL: "DISCOVERY OF THREE 
NOVEL G-PR0TEIN-COUPLED RECEPTOR GENES" 
GENOMICS , ACADEMIC PRESS, SAN DIEGO, US, 
vol. 47, no. 2, 

15 January 1998 (1998-01-15), pages 
310-313, XP000863786 
ISSN: 0888-7543 
abstract 

W0 97 21731 A (NEW ENGLAND MEDICAL CENTER 
INC) 19 June 1997 (1997-06-19) 
page 18, line 18-26; figures 2,3 

MARCHESE A ET AL: "Novel GPCRs and their 

endogenous ligands: expanding the 

boundaries of physiology and pharmacology" 

TRENDS IN PHARMACOLOGICAL 

SCIENCES, GB, ELSEVIER TRENDS JOURNAL , 

CAMBRIDGE , 

vol . 20, no. 9, 

1 September 1999 (1999-09-01), pages 
370-375, XP004178194 
ISSN: 0165-6147 
abstract 

-/-- 



1-4 



1-4 



1-4 



1-4 



1-4 



Form PCT/ISA/210 (continuation ot saconosneoll (July 1992) 



6 



INTERNATIONAL SEARCH REPORT 


Intet jnal Application No 

PCT/US 00/31509 


C. (Continuation) DOCUMENTS CONSIDERED TO BE RELEVANT 


Caieoory *■ 


Citation ot document with indication wnere appropriate ot ihe relevant passages 


Relevant to claim No 


P.X 


DATABASE EMBL 'Online! 

Arroc ci nn Kir* HQNTTn 

1 October 2000 (2000-10-01) 

COLLIER R . : "DJ680N.3 (G-Protein Coupled 

Receptors ) ( Fragment )" 

XP002168498 

abstract 




1-4 


P.X 


WU UU Ltl Jl n vMrVLIMrt r nMI\t IMLLU 1 J.UMLJ 1 IXU 

;60RE MARTIN (US); LIAW CHEN W (US); LIN) 
20 April 2000 (2000-04-20) 
the whole document 




1-4 


V 

A 


DATABASE EMBL 'Online' 

AC: AC008728, 4 August 1999 (1999-08-04) 

DOE JOINT GENOME INSTITUTE: "Sequencing of 

Human Chromosome 5" 

XP002175776 

aUSLr dCl 




5-8 


A 


WO 98 29439 A (SULLIVAN KATHLEEN ;MERCK & 
CO INC (US); TAN CARINA (US)) 

Q Tiilw 1 QQ£ MQQft-D7-nQ^ 

y uu i y i?yo v I-?-? o u/ j 

page 57; figure 13; example 14 




r _o 
D O 


E 


WO 01 14577 A ( SMITHKLINE BEECHAM PLC 
; SMITHKLINE BEECHAM CORP (US)) 
1 March 2001 (2001-03-01) 
page 30-31; claims 1,2 




5-8 


E 


rp i nan qqq a fPFTyrp i m -PFT7PP fiKU 
11 April 2001 (2001-04-11) 
Seq. Id. No. 1, 2 




5-8 


X 


DATABASE EMBL 'Online! 

AC: AC008754. 4 August 1999 (1999-08-04) 

DOE JOINT GENOME INSTITUTE: "Homo sapiens 

chomosome 19 clone CTD-3023J11, complete 

sequence" 

XP002175778 

abstract 




9-12 


X 


DATABASE EMBL 'Online! 

AC: AQ532303, 18 May 1999 (1999-05-18) 

ZHAO S ET AL.: "Use of BAC End Sequences 

from Library RPCI-11 for Sequence-Ready 

Map Building" 

XP002175779 

abstract 




9-12 



form PCT/tSA/2 10 |cantr>ua«ion ot sooona sr*»et i (July 1992) 



page 3 of 9 



INTERNATIONAL SEARCH REPORT 



Intei onal Application No 

PCT/US 00/31509 



C.(Continuation) DOCUMENTS CONSIDERED TO BE RELEVANT 


Category - 


Citation ot document, wrih indica lion, where appropriate, ot me relevant passaqes 


Hetevant to claim No 


P,X 


DATABASE EMBL 'Online! 


9-12 




AC: AB033237, 4 May 2000 (2000-05-04) 






0H0N0 ET AL.: "Homo spiens mRNA for G 






protei ne-coupl ed receptor C5L2, complete 






cds" 






XP002175947 






abstract 




P,x 




WO 00 14229 A (0N0 MITSUHARU ;KANN0 


9-12 




KIMIYOSHI (JP); ASAHI CHEMICAL IND (JP); 






ISHI) 16 March 2000 (2000-03-16) 






page 98 -page 101; claim 4 






page 103; cl aim 5 




E 


— . — 

W0 01 36471 A (ARENA PHARMACEUTICALS INC 


5-28, 




; CHEN RUOPING (US); DANG HUONG T (US); L) 


41-52 




25 May 2001 (2001-05-25) 






Seq. Id. No. 3 (claims 5-8) 






Seq. Id. No. 5 (claims 9-12) 






Seq. Id. No. 7 (claims 13-16) 






Sea Id No 9 (claims 17-20) 






Seq. Id. No. 11 (claims 17-20) 






Seq. Id. No. 13 (claims 25-28) 






Seq. Id. No. 21 (claims 41-44) 






Seq. Id. No. 19, 23 (claims 45-48) 






Sea ID No 25 (claims 49-52} 




E 


EP 1 094 076 A (PFIZER LTD ;PFIZER (US)) 


9-12 




25 April 2001 (2001-04-25) 






Seq. Id. No. 1 






E 


W0 01 31014 A (UPJOHN CO ;V0GELI GABRIEL 


13-16 




(US); WOOD LINDA S (US); MERCHANT KALPAN) 






3 May 2001 (2001-05-03) 






Sequence No. 5 




X 




PATENT ABSTRACTS OF JAPAN 


13-16 




vol . 1999, no. 09, 






30 July 1999 (1999-07-30) 






& JP 11 098988 A (SMITHKLINF RFFfHAM 






CORP), 13 April 1999 (1999-04-13) 






abstract 






& DATABASE EMBL 'Online! 






AC* E31720- E75225 






22 February 2001 (2001-02-22) 






JEFFREY L.M.D.D. AND BERGSUMA W.S.H.H.: 






"cDNA clone HeoAd54 encoding human 






seven-pass transmembrane receptor" 






abstract 




X 


US 5 955 308 A (BERGSMA DERK J ET AL) 


13-16 




21 September 1999 (1999-09-21) 






Sequence 1 






-/-- 





Form PCT/lSA/210 (continuation ot 



sheet) (July 1992) 



INTERNATIONAL SEARCH REPORT 


inte onai Application No 

PCT/US 00/31509 


C.(Contlnuationi DOCUMENTS CONSIDERED TO BE RELEVANT 


Category 


Oilalton ot document wilh inclicanon where appropriate o1 Ute relevant passacjob 


( Hetevani lo claim Nt ■ 


X 


EP 0 892 051 A ( SMITHKLINE BEECHAM CORP) 
20 January 1999 (1999-01-20) 
page 20 -page 21; claim 11 




13-16 


X,P 


DATABASE EMBL 'Online! 

AC: AP000808, 3 December 1999 (1999-12-03) 

HATTORI M. ET AL. : "Homo sapiens 171,539 

genomic of llqli 

XP002175780 

abstract 




17-20 


X 


WO 99 32519 A (FORTIN YVES ; LEMB0 PA0LA 
(CA); AHMAD SULTAN (CA); BANVILLE DENIS () 




37-40 


A 

X 
X 


page 48 -page 54; claims 16,20.21 

page 55 -page 56; claim 25 
page 52 -page 54; claim 21 




17-20. 
57-60 
37-40 
37-40 


X 


DATABASE EMBL 'Online! 

AC011780 , 18 October 1999 (1999-10-18) 

BIRREN B., LINTON L., NUSBAUM C, LANDER 

E . : "Homo sapiens clone RP11-15H8, 31 

unordered pi eces . " 

XP002175781 

abstract 




21-24 


X 


JP 08 245697 A (TAKEDA CHEM IND LTD) 
24 September 1996 (1996-09-24) 
claim 4 ; f i gures 1 ,2 




21-24 


X 


W0 96 05302 A (FUJI I RYO ;H0S0YA MASAKI 
(JP); 0HGI KAZUHIR0 (JP); FUKUSUMI SH0JI ) 
22 February 1996 (1996-02-22) 
page 263 -page 264; example 16 




21-24 


P,X 


DATABASE EMBL 'Online! 

AC: AL355310 , 5 May 2000 (2000-05-05) 

WALLIS, J: "Human DNA sequence from clone 

RP5-1160K1" 

XP002175782 

abstract 




21-24 


X 


DATABASE EMBL 'Online! 
AC: AOO01459, 

ADAMS M.D. ET AL. : "CIT-HSP-2286K19 . TF 
CIT-HSP Homo sapiens genomic clone 
2286K19, genomic survey sequence" 
XP002175783 
abstract 




25-28 


A 


EP 0 878 542 A (SMITHKLINE BEECHAM CORP) 
18 November 1998 (1998-11-18) 
page 18 -page 19; claim 1 

-/- 




25-28 



Form PCT /ISA/210 (contmuahon o* «*cor*a sfteet) (Juty 1 992 



pag^^o^9 



INTERNATIONAL SEARCH REPORT 



Inter onai Application No 

PCT/US 00/31509 



C.iContmuationj DOCUMENTS CONSIDERED TO BE RELEVANT 



(, nation ol document wilh indication where appropriate or the relevant passages 



I Hetevant to claim No 



P.X 



P,X 



P,X 



HEISE CHRISTOPHER E ET AL: 
"Characterization of the human cysteinyl 
leukotriene 2 receptor." 
JOURNAL OF BIOLOGICAL CHEMISTRY , 
vol. 275, no. 39, 

29 September 2000 (2000-09-29), pages 

30531-30536, XP002175775 

ISSN: 0021-C258 

the whole document 

& DATABASE EMBL 'Online! 

AC: AF254664, 

HEISE C.E. ET AL . : "Homo sapiens cysteinyl 
leukotriene receptor CYSLT2 gene, complete 
cds. " 
abstract 



US 5 861 309 A ( WEINSHANK RICHARD L 
AL) 19 January 1999 (1999-01-19) 
Sequence 1 



ET 



W0 01 09184 A (DELEERSNI JDER WILLY ;NYS 
GUY (BE); ZHANG FAN (BE); S0LVAY PHARMACE) 
8 February 2001 (2001-02-08) 

Sequence 1 

page 6 -page 7; claims 1,15 

DATABASE EMBL 'Online! 

AC016468, 1 December 1999 (1999-12-01) 

BIRREN B. ET AL. : " Homo sapiens clone 

RP11-14N15" 

XP002175784 

abstract 

WO 99 48921 A (ORGANON NV ;SPEK PETRUS 
JOHANNES V D (NL); UNIV LELAND STANFORD 
JU) 30 September 1999 (1999-09-30) 
claims 2,4; figure 4 

DATABASE EMBL 'Online! 

AL136106, 7 January 2000 (2000-01-07) 

BURTON J: "Human DNA sequence from clone 

RP11-15909" 

XP002175785 

abstract 

DATABASE EMBL 'Online! 
AC: AC008547, 

0E JOINT GENOME INSTITUTE STANFORD HUMAN 
GENOME CENTER.: "Homo sapiens chromosome 5 
clone CTC-502M5, complete sequence." 
XP002175786 
abstract 



25-28 



29-32 



29-32 



29-32 



33-36 



33-36 



41-44 



Form PCT/lSA/210 (contnualion ol second sneot ) (July 1992) 



INTERNATIONAL SEARCH REPORT 


Inte .onal Apphcatton No 

PCT/US 00/31509 


C.(Conli mation) DOCUMENTS CONSIDERED TO BE RELEVANT 


Caiegor 


Citation ol document, wilfi indication wriere appropriate ot trie retevam passages 


H^tevanl lo claim No 


A 


WO 99 06552 A (GENSET SA :LACR0IX BRUNO 
(FR); DUCLERT AYMERIC (FR): DUMAS MILNE E) 
11 February 1999 (1999-02-11) 
SEQ ID NO: 95 




41-44 


A 


EP 0 612 845 A (AMERICAN CYANAMID CO) 
Ji August iyy4 ( iy94-08-ii ; 
claim 2; figure 9 




49-52 


A 


DATABASE EMBL 'Online! 
AC: AL065769, 29 May 1999 (1999-05-29) 
GEMPSCPUE: "Drosophila melanogaster genome 
survey sequence TET3 end of BAC # 

n x pr>nni/i n .c n r* /*" T r\ n ~i l- — . - r . ,. - — 

Dtti>r\uoMu ui i\r oi~7o i tut ai y i i uui 

Drosophila melanogaster (fruit fly)" 

XP002175910 

abstract 




49-52 


P,X 


DATABASE EMBL 'Online! 

AC: AH61458, 16 April 2000 (2000-04-16) 

BURTON J. ET AL.: "Human DNA sequence from 

clone RP11-163L4" 

XP002175911 

abstract 




49-52 


A 


B0YER JOSE L ET AL: "Molecular cloning 
and expression of an avian G 
protein-coupled P2Y receptor." 
MOLECULAR PHARMACOLOGY, 
vol. 52, no. 6, December 199/ (1997-12), 
pages 928-934, XP002175907 
ISSN: 0026-895X 
the whole document 




53-56 


P.X 


DATABASE EMBL 'Online! 

AC: AC026756, 24 April 2000 (2000-04-24) 

ABOLA A. P. ET AL.: "omo sapiens chromosome 

13 clone RP11-286P8, complete sequence" 

XP002175912 

abstract 




53-56 


P,X 


DATABASE EMBL 'Online! 

AC AC027026, 27 April 2000 (2000-04-27) 

BIRREN B. ET AL.: "Homo sapiens chromosome 

11, clone RP11-589F4" 

XP002175913 

abstract 




57-60 


r 


W0 01 16159 A (SMITHKLINE BEECHAM CORP) 
8 March 2001 (2001-03-08) 
page 30; claim 1 

-/-- 




57-60 



Form PCT1SA/210 (eonimuation ot s»cooa tnoeli <Juty 1992 



aae 7 of 9 



pao 



INTERNATIONAL SEARCH REPORT i lnle a pp"""oh No 

! PCT/US 00/31509 



C.i Conunuauon j uucuMtN f b UUNblDERED TO BE RELEVANT 



Cdteqory - 


uiidhon oi (Jocumeni witn indication. wht,-rf- a p propria! e l<\ U.e relevant pdssaues 


R«.-tevant to claim No 


Y 

A 


DATARA^F PMRI 'nnlino! 


o 1-54 




AC: AC007104, 23 April 1999 (1999-04-23) 






STONE ET AL.: "Homo sapiens chomosome 4, 






1 f\ i in ArHflr£>rl n i nrn c " 
1U UIIUi Uct cU p 1 tfLcb 






y POO? 1 7^Q1 A 






abstract 




f 

r_ 


WU Ul A t^tKLN rAltlMI blMDrf ,UULLKtK 


61-64 




k' \ A 1 1 c f nr \ \ oo Cdhrinri/ 9nm / onm r\9 oo \ 

NLnU j \UC J ) LC reDrudry £UU1 \ d\J\J l — \J £ — ) 






OtrLJ U tr nut? 1 , £ 






n^np *3 Q • rl aim ^ 




X 


JP 11 032770 A (ASAHI CHEM IND CO LTD) 


65-68 




y reuruary iyyy i iyyy-uz— uy ; 






page 19 ; claim 7 




Y 
A 


Lin OR cAQ?n a (e\ cum idpatv madti a 


69-72 




, orll 1 HKLlNh dLLLHAh CGKr (US); LI XIAOT0NG 






iub; 1/ uecemDer 1998 (1998-12-17) 






page 30 -page 31; claims 1,2 




A 


the whole document 


45-48 


v 
A 


uMiAKAot t nt5 L un 1 1 ne ! 


73-76 




AT * APm DQft/l 






oepLernDer iyyy v iyyy— uy— ^y ) 






WAitKbUN K.H.: Homo sapiens chromosome 2 






clone RP11-510C1" 






XP002175915 






dui> L r dCL 




A 


nATARA^F FMRI 'nnlino! 


/ 6-/0 




AL UUooyZ , lb July 1998 (1998-07-15) 






wt iNbHANK k. H.. b-Hydr oxytrypt am 1 ne IB 






Keceptori -h i -ib ; v serotonin Keceptor ) 






Ar UU£ 1 /by4o 






du 5 L rdCl 




v 

A 


MAUATDAC fDCTADV P n Al . 

MAHAiKAb bKtbUKY b hi AL : 


77-80 




"Sequence-tagged connectors: A sequence 






approach to mapping and scanning the human 






genome . " 






ppAprrnTMPC nr tuc mattamai apahtmv nr 
rKUuttulIMbo Ur 1 hit NA11UNAL AUAUtnY Ur 






oUlcJMLLo Ur 1 lit UnJIItU j 1 Al to , 






vol. y o , no . i / , 






17 August 1999 (1999-08-17), pages 






9739-9744. XP002175909 






Aug. 17, 1999 






ISSN: 0027-8424 






trie wnoie uocument 




E 


WO 01 07606 A (SMITHKLINE BEECHAM PLC) 


77-80 




1 February 2001 (2001-02-01) 






Sequence 2 






page 31; claim 4 











Form PCT/lSA'2iO (connnuation 01 secona sfieeu uuiy 1992) 



Daae 8 of Q 



INTERNATIONAL SEARCH REPORT 



lnt( ional Application No 

PCT/US 00/31509 



C.(Contir>uation) DOCUMENTS CONSIDERED TO BE RELEVANT 



Oalegory " (.Jlaltun of oocument wtlh indication where appropriale ol the relevant passages 



Metevani to claim No 



P,x 



WO 00 49046 A (TERA0 YASUKO ; WATANABE 
TAKUYA (JP); SHINTANI YASUSHI (JP); 
TAKEDA) 24 August 2000 (2000-08-24) 
claim 5: figure 1 



77-80 



Form PCT/1&A/210 (continuation o* seoanasneeli (July 199? 



of 9 



INTERNATIONAL SEARCH REPORT 



Im^, national application No. 

PCT/US 00/31509 
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2. I | Claims Nos.: 

because they relate to Darts of the international Application that do not comply with the prescribed requirements to such 
an extent that no meamngtul International Search can be carried out, specifically: 



3. | [ Claims Nos.: 

because they are dependent claims and are not drafted in accordance with the second and third sentences of Rule 6.4(a). 

Box il Observations where unity of invention is lacking (Continuation of item 2 of first sheet) 

This international Searching Authority found multiple inventions in this international application, as follows: 
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International Application No. PCT/US 00 /S1509 



FURTHER INFORMATION CONTINUED FROM PCT/ISA/ 210 



This International Searching Authority found multiple (groups of) 
inventions in this international application, as follows: 

1. Claims: 1-4 

G protein-coupled receptor as characterized by SEQ.ID.2, its 
non-endogenous, consti tuti vely activated version, a cDNA 
encoding said receptor as characterized by SEQ.ID.l, a 
plasmid comprising said cDNA, and a host cell comprising 
said plasmid. 



2. Claims: 5-8 

G protein-coupled receptor as characterized by SEG.ID.4, its 
non-endogenous, consti tuti vely activated version, a cDNA 
encoding said receptor as characterized by SEQ.ID.3, a 
plasmid comprising said cDNA, and a host cell comprising 
said plasmid. 



3. Claims: 9-12 

G protein-coupled receptor as characterized by SEQ.ID.6, its 
non-endogenous, consti tut i vel y activated version, a cDNA 
encoding said receptor as characterized by SEG.ID.5, a 
plasmid comprising said cDNA, and a host cell comprising 
said plasmid. 



4. Claims: 13-16 

G protein-coupled receptor as characterized by SEQ.ID.8, its 
non-endogenous, consti tuti vely activated version, a cDNA 
encoding said receptor as characterized by SEQ.ID.7, a 
plasmid comprising said cDNA, and a host cell comprising 
said plasmid. 



5. Claims: 17-20 

G protein-coupled receptor as characterized by SEQ.ID.10, 
its non-endogenous, consti tut i vely activated version, a cDNA 
encoding said receptor as characterized by SEQ.ID.9, a 
plasmid comprising said cDNA, and a host cell comprising 
said pi asmid. 



6. Claims: 21-24 

G protein-coupled receptor as characterized by SEQ.ID.12, 
its non-endogenous, consti tuti vely activated version SEQ 
ID. 84, a cDNA encoding said receptor as characterized by 
SEQ. ID. 11, a plasmid comprising said cDNA, and a host cell 
comprising said plasmid. 
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7. Claims: 25-28 

G protein-coupled receptor as characterized by SEQ.ID. 14, 
its non-endogenous, consti tuti vely activated version SEQ.ID. 
88, a cDNA encoding said receptor as characterized by 
SEQ.ID. 13, a plasmid comprising said cDNA, and a host cell 
comprising said plasmid. 



8. Claims: 29-32 

G protein-coupled receptor as characterized by SEQ.ID. 16, 
its non-endogenous, consti tuti vely activated version 
SEQ.ID. 92, a cDNA encoding said receptor as characterized by 
SEQ.ID. 15, a plasmid comprising said cDNA , and a host cell 
comprising said plasmid. 



9. Claims: 33-36 

G protein-coupled receptor as characterized by SEQ.ID. 18, 
its non-endogenous, consti tuti vely activated version, a cDNA 
encoding said receptor as characterized by SEQ.ID. 17, a 
plasmid comprising said cDNA, and a host cell comprising 

f ^ "i rl r\ 1 -j c* m -i /H 
iu (J t QJlM I U . 



10. Claims: 37-40 

G protein-coupled receptor as characterized by SEQ.ID. 20, 
its non-endogenous, constituti vely activated version, a cDNA 
encoding said receptor as characterized by SEQ.ID. 19, a 
plasmid comprising said cDNA, and a host cell comprising 
said plasmid. 



11. Claims: 41-44 

G protein-coupled receptor as characterized by SEQ.ID. 22, 
its non-endogenous, constituti vely activated version, a cDNA 
encoding said receptor as characterized by SEQ.ID. 21, a 
plasmid comprising said cDNA, and a host cell comprising 
said plasmid. 



12. Claims: 45-48 

G protein-coupled receptor as characterized by SEQ.ID. 24, 
its non-endogenous, consti tutively activated version, a cDNA 
encoding said receptor as characterized by SEQ.ID. 23, a 
plasmid comprising said cDNA, and a host cell comprising 
said plasmid. 
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13. Claims: 49-52 

G protein-coupled receptor as characterized by SEQ.ID. 26, 
its non-endogenous, constituti vely activated version, a cDNA 
encoding said receptor as characterized by SEQ.ID. 25, a 
plasmid comprising said cDNA, and a host cell comprising 
said plasmid. 



14. Claims: 53-56 

G protein-coupled receptor as characterized by SEQ.ID.28, 
its non-endogenous, constituti vely activated version, a cDNA 
encoding said receptor as characterized by SEQ.ID. 27, a 
plasmid comprising said cDNA, and a host cell comprising 
said plasmid. 

15. Claims: 57-60 

G protein-coupled receptor as characterized by SEQ.ID. 30, 
its non-endogenous, const i tut i vely activated version, a cDNA 
encoding said receptor as characterized by SEQ.ID.29, a 
plasmid comprising said cDNA, and a host cell comprising 
said plasmid. 



16. Claims: 61-64 

G protein-coupled receptor as characterized by SEQ.ID. 32, 
its non-endogenous, constituti vely activated version, a cDNA 
encoding said receptor as characterized by SEQ.ID. 96, a 
plasmid comprising said SEQ.ID 95, and a host cell 
comprising said plasmid. 



17. Claims: 65-68 

G protein-coupled receptor as characterized by SEQ.ID. 34, 
its non-endogenous, constituti vely activated version, a cDNA 
encoding said receptor as characterized by SEQ.ID. 33, a 
plasmid comprising said cDNA, and a host cell comprising 
said plasmid. 



18. Claims: 69-72 

G protein-coupled receptor as characterized by SEQ.ID. 36, 
its non-endogenous, constituti vely activated version, a cDNA 
encoding said receptor as characterized by SEQ.ID. 35, a 
plasmid comprising said cDNA, and a host cell comprising 
said plasmid. 
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19. Claims: 73-76 

G protein-coupled receptor as characterized by SEQ.ID.38, 
its non-endogenous, consti tut i vely activated version, a cDNA 
encoding said receptor as characterized by SEQ.ID.37, a 
plasmid comprising said cDNA, and a host cell comprising 
said plasmid. 



20. Claims: 77-80 

G protein-coupled receptor as characterized by SEQ.ID.40, 
its non-endogenous, constituti vely activated version, a cDNA 
encoding said receptor as characterized by SEQ.ID.39, a 
plasmid comprising said cDNA, and a host cell comprising 
said plasmid. 
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